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A NEW METHOD FOR TAPPING 
THE IMMUNOLOGICAL REPERTOIRE 

Description 

5 

Technical Field 

The present invention relates to a method 
for isolating a gene coding for a receptor having a 
preselected activity. 

10 

Background 

Binding phenomena between ligands and 
receptors play many crucial roles in biological 
systems. Exemplary of such phenomena are the 

15 binding of oxygen molecules to deoxyhemoglobin to 
form oxyhemoglobin, and the binding of a substrate 
to an enzyme that acts upon it such as between a 
protein and a protease like trypsin. Still further 
examples of biological binding phenomena include 

20 the binding of an antigen to an antibody, and the 

binding of complement component C3 to the so-called 
CR1 receptor. 

Many drugs and other therapeutic agents 
are also believed to be dependent upon binding 

25 phenomena. For example , opiates such as morphine 
re reported to bind to specific receptors in the 
brain. Opiate agonists and antagonists are 
reported to compete with drugs like morphine for 
those binding sites. 

30 Ligands such as man-made drugs, like 

morphine and its derivatives, and those that are 
naturally present in biological systems such as 
endorphins and hormones bind to receptors that are 
naturally present in biological systems, and will 

35 be treated together herein. Such binding can lead 
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to a number of the phenomena of biology , including 
particularly the hydrolysis of amide and ester 
bonds as where proteins are hydrolyzed into 

constituent polypeptides by an enzyme such as * 
5 trypsin or papain or where a fat is cleaved into 

glycerine and three carboxylic acids, respectively. % 
In addition, such binding can lead to formation of 
amide and ester bonds in the formation of proteins 
and fats, as well as to the formation of carbon to 

10 carbon bonds and carbon to nitrogen bonds. 

An exemplary receptor-producing system in 
vertebrates is the immune system. The immune 
system of a mammal is one of the most versatile 
biological systems as probably greater than 1.0 x 

15 10 7 receptor specificities, in the form of 

antibodies, can be produced. Indeed, much of 
contemporary biological and medical research is 
directed toward tapping this repertoire. During 
the last decade there has been a dramatic increase 

20 in the ability to harness the output of the vast 
immunological repertoire. The development of the 
hybridoma methodology by Kohler and Milstein has 
made it possible to produce monoclonal antibodies, 
i.e., a composition of antibody molecules of a 

25 single specificity, from the repertoire of 

antibodies induced during an immune response. 

Unfortunately, current methods for 
generating monoclonal antibodies are not capable of 
efficiently surveying the entire antibody response 

30 induced by a particular immunogen. In an 

individual animal there are at least 5-10,000 

different B-cell clones capable of generating * 
unique antibodies to a small relatively rigid 

immunogens, such as, for example dinitrophenol . * 
35 Further, because of the process of somatic mutation 
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during the generation of antibody diversity, 
essentially an unlimited number of unique antibody 
molecules may be generated. In contrast to this 
vast potential for different antibodies, current 
5 hybridoma methodologies typically yield only a few 
hundred different monoclonal antibodies per fusion. 

Other difficulties in producing 
monoclonal antibodies with the hybridoma 
methodology include genetic instability and low 

10 production capacity of hybridoma cultures. One 
means by which the art has attempted to overcome 
these latter two problems has been to clone the 
immunoglobul in-producing genes from a particular 
hybridoma of interest into a prokaryotic expression 

15 system. See, for example, Robinson et al., PCT 

Publication No. WO 89/0099; Winter et al., European 
Patent Publication No. 0239400; Reading, U.S. 
Patent No. 4,714,681; and Cabilly et al., European 
Patent Publication No. 0125023. 

20 The immunologic repertoire of vertebrates 

has recently been found to contain genes coding for 
immunoglobulins having catalytic activity. 
Tramontano et al., Sci. , 234:1566-1570 (1986); 
Pollack et al., Sci. . 234:1570-1573 (1986); Janda 

25 et al., Sci. . 241:1188-1191 (1988); and Janda et 

al., Sci. , 244:437-440 (1989). The presence of, or 
the ability to induce the repertoire to produce, 
antibodies molecules capable of a catalyzing 
chemical reaction, i.e., acting like enzymes, had 

30 previously been postulated almost 20 years ago by 
W. P. Jencks in Catalysis in Chemistry and 
Enzvmolocrv . McGraw-Hill, N.Y. (1969). 

It is believed that one reason the art 
failed to isolate catalytic antibodies from the 

35 immunological repertoire earlier, and its failure 
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4 

to isolate many to date even after their actual 
discovery, is the inability to screen a large 
portion of the repertoire for the desired activity. 
Another reason is believed to be the bias of * 
5 currently available screening techniques, such as 

the hybridoma technique, towards the production * 
high affinity antibodies inherently designed for 
participation in the process of neutralization, as 
opposed to catalysis. 



Brief Summary of the Invention 

The present invention provides a novel 
method for screening a larger portion of a 
15 conserved receptor coding gene repertoire for 

receptors having a preselected activity than has 
heretofore been possible, thereby overcoming the 
before-mentioned inadequacies of the hybridoma 
technique* 

20 In one embodiment, a conserved receptor- 

coding gene library containing a substantial 
portion of the conserved receptor-coding gene 
repertoire is synthesized. In preferred 
embodiments, the conserved receptor-coding gene 

25 library contains at least about 10 3 , preferably at 
least about 10 4 and more preferably at least about 
10 5 different receptor-coding genes. 

The gene library can be synthesized by 
either of two methods, depending on the starting 

30 material. 

Where the starting material is a 
plurality of receptor-coding genes, the repertoire 
is subjected to two distinct primer extension 
reactions. The first primer extension reaction 
35 uses a first polynucleotide synthesis primer 



capable of initiating the first reaction by 
hybridizing to a nucleotide sequence conserved 
(shared by a plurality of genes) within the 
repertoire. The first primer extension produces of 
different conserved receptor-coding homolog 
compliments (nucleic acid strands complementary to 
the genes in the repertoire) . 

The second primer extension reaction 
produces, using the complements as templates, a 
plurality of different conserved receptor-coding 
DNA homologs. The second primer extension reaction 
uses a second polynucleotide synthesis primer that 
is capable of initiating the second reaction by 
hybridizing to a nucleotide sequence conserved 
among a plurality of the compliments. 

Where the starting material is a 
plurality of compliments of conserved receptor- 
coding genes, the repertoire is subjected to the 
above-discussed second primer extension reaction. 
Of course, if both a repertoire of conserved 
receptor-coding genes and their complements are 
present, both approaches can be used in 
combination. 

A conserved receptor-coding DNA homolog, 
i.e., a gene coding for a receptor capable of 
binding the preselected ligand, is then segregated 
from the library to produce the isolated gene. 
This is typically accomplished by operatively 
linking for expression a plurality of the different 
conserved receptor-coding DNA homologs of the 
library to an expression vector. The receptor- 
expression vectors so produces are introduced into 
a population of compatible host cells, i.e., cells 
capable of expressing a gene operatively linked for 
expression to the vector. The transformants are 
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cultured under conditions for expressing the 
receptor corded for by the receptor-coding DNA 
homolog. The transformants are cloned and the 
clones are screened for expression of a receptor 
5 that binds the preselected ligand. Any of the 
suitable methods well known in the art for 
detecting the binding of a ligand to a receptor can 
be used. A trans formant expressing the desired 
activity is then segregated from the population to 

10 produce the isolated gene. 

In another embodiment, the present 
invention contemplates a gene library comprising an 
isolated admixture of at least about 10 3 , preferably 
at least about 10 4 and more preferably at least 10 5 

15 conserved receptor-coding DNA homologs, a plurality 
of which share a conserved antigenic determinant. 
Preferably, the homologs are present in a medium 
suitable for in vitro manipulation, such as water, 
phosphate buffered saline and the like, which 

20 maintains the biological activity of the homologs. 

A receptor having a preselected activity, 
preferably catalytic activity, produced by a method 
of the present invention, preferably a monomer or 
dimer as described herein, is also contemplated. 



Brief Description of the Drawings 

In the drawings forming a portion of this 
disclosure: 

30 Figure 1 Illustrates a schematic 

diagram of the immunoglobulin molecule showing the 
principal structural features. The circled area on 
the heavy chain represents the variable region (V H ) , 
a polypeptide containing a biologically active 

35 (ligand binding) portion of that region, and a gene 
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coding for that polypeptide, are produced by the 
methods of the present invention. Sequences L03, 
L35, L47 and L48 could not be classified into any 
predefined subgroups. 
5 Figure 2A Diagrammatic sketch of an H 

chain of human IgG (IgGl subclass) . Numbering is 
from the N-terminus on the left to the C-terminus 
on the right. Note the presence of four domains, 
each containing an intrachain disulfide bond (S-S) 

10 spanning approximately 60 amino acid residues. The 
symbol CHO stands for carbohydrate. The V region 
of the heavy (H) chain (V„) resembles V L in having 
three hypervariable CDR (not shown) . 

Figure 2B Diagrammatic sketch of a human 

15 K chain (Panel 1) . Numbering is from the N- 
terminus on the left to the C-terminus on the 
right. Note the intrachain disulfide bond (S-S) 
spanning about the same number of amino acid 
residues in the V L and C L domains. Panel 2 shows 

20 the locations of the complementarity-determining 
regions (CDR) in the V L domain. Segments outside 
the CDR are the framework segments (FR) . 

Figure 3 Amino acid sequence of the V H 
regions of 19 mouse monoclonal antibodies with 

25 specificity for phosphorylcholine. The designation 
HP indicates that the protein is the product of a 
hybridoma. The remainder are myeloma proteins. 
(From Gearhart et al., Nature , 291:29, 1981.) 

Figure 4 Illustrates the results 

30 obtained from PCR amplification of mRNA obtained 
from the spleen of a mouse immunized with FITC. 
Lanes R17-R24 correspond to amplification reactions 
with the unique 5 1 primers (2-9, Table 1) and the 
3' primer (12, Table 1), R16 represents the PCR 

35 reaction with the 5 1 primer containing inosine (10, 
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Table 1) and 3' primer (12, Table 1). Z and R9 are 
the amplification controls; control Z involves the 
amplification of V H from a plasmid (PLR2) and R9 
represents the amplification from the constant * 
5 regions of spleen mRNA using primers 11 and 13 
(Table 1) . 

Figure 5 Nucleotide sequences are 
clones from the cDNA library of the PCR amplified V H 
regions in Lambda ZAP. The N-terminal 110 bases 

10 are listed here and the underlined nucleotides 

represent CDR1 (complementary determining region) . 

Figure 6 The sequence of the synthetic 
DNA insert inserted into Lambda ZAP to produce 
Lambda Zap II V H (Panel A) and Lambda Zap V L (Panel 

15 B) expression vectors. The various features 

required for this vector to express the V H and V L - 
coding DNA homologs include the Shine-Dalgarno 
ribosome binding site, a leader sequence to direct 
the expressed protein to the periplasm as described 

20 by Mouva et al., J. Biol. Chem. . 255:27, 1980, and 
various restriction enzyme sites used to 
operatively link the V H and V L homologs to the 
expression vector. The V H expression-vector 
sequence also contains a short nucleic acid 

25 sequence that codes for amino acids typically found 
in variable regions heavy chain (V H Backbone) . This 
V H Backbone is just upstream and in the proper 
reading as the V H DNA homologs that are operatively 
linked into the Xho I and Spe I. The V L DNA 

30 homologs are operatively linked into the V L sequence 
(Panel B) at the Nco I and Spe I restriction enzyme 
sites and thus the V H Backbone region is deleted 1 
when the V L DNA homologs are operatively linked into 
the V L vector. i 

35 Figure 7 The major features of the 
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bacterial expression vector Lambda Zap II V H (V H - 
expression vector) are shown. The synthetic DNA 
sequence from Figure 6 is shown at the top along 
with the T 3 polymerase promoter from Lambda Zap II. 
5 The orientation of the insert in Lambda Zap II is 
shown. The V H DNA homologs are inserted into the 
Xho I and Spe I restriction enzyme sites. The V H 
DNA are inserted into the Xho I and Spe I site and 
the read through transcription produces the 

10 decapeptide epitope (tag) that is located just 3 1 
of the cloning sites. 

Figure 8 The major features of the 
bacterial expression vector Lambda Zap II V L (V L 
expression vector) are shown. The synthetic 

15 sequence shown in Figure 6 is shown at the top 

along with the T 3 polymerase promoter from Lambda 
Zap II. The orientation of the insert in Lambda 
Zap II is shown. The V L DNA homologs are inserted 
into the phagemid that is produced by the in vivo 

20 excision protocol described by Short et al., 

Nucleic Acids Res. , 16:7583-7600, 1988. The V L DNA 
homologs are inserted into the Nco I and Spe I 
cloning sites of the phagemid. 

Figure 9 A modified bacterial 

25 expression vector Lambda Zap II V L II. This vector 
is constructed by inserting this synthetic DNA 
sequence, 

TGAATTCTAAACTAGTCGCCAAGGAGACAGTCATAATGAA 
TCGAACTTAAGATTTGATCAGCGGTTCCTCTGTCAGTATTACTT 

30 

ATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCTG 
TATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGAC 

CCCAACCAGCCATGGCCGAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 
35 GGGTTGGTCGGTACCGGCTCGAGCAGTCAAGATCTCAATTCGCCGGCAGCT 
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into Lambda Zap II that has been digested with the 
restriction enzymes Sac I and Xho I. This sequence 
contains the Shine-Dalgarno sequence (Ribosome 
binding site) , the leader sequence to direct the 
5 expressed protein to the periplasm and the 

appropriate nucleic acid sequence to allow the V L 
DNA homologs to be operatively linked into the SacI 
and Xba I restriction enzyme sites provided by this 
vector. 

10 Figure 10 The sequence of the synthetic 

DNA segment inserted into Lambda Zap II to produce 
the lambda V L II-expression vector. The various 
features and restriction endonuclease recognition 
sites are shown. 

15 Figure 11 The vectors for expressing V H 

and V L separately and in combination are shown. The 
various essential components of these vectors are 
shown. The light chain vector or V L expression 
vector can be combined with the V H expression vector 

20 to produce a combinatorial vector containing both V H 
and V L operatively linked for expression to the same 
promoter. 

Figure 12 The labelled proteins 
immunoprecipitated from E. coli containing a V H and 

25 a V L DNA homolog are shown. In lane 1, the 

background proteins immunoprecipitated from E. coli 
that do not contain a V H or V L DNA homolog are 
shown. Lane 2 contains the V H protein 
immunoprecipitated from E. coli containing only a V H 

30 DNA homolog. In lanes 3 and 4, the co-migration of 
a V H protein a V L protein immunoprecipitated from E. 
coli containing both a V H and a V L DNA homolog is 
shown. In lane 5 the presence of V H protein and V L 
protein expressed from the V H and V L DNA homologs is 

35 demonstrated by the two distinguishable protein 
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species. Lane 5 contains the background proteins 
immunoprecipitated by anti -E. coli antibodies 
present in mouse ascites fluid. 

Figure 13 The transition state analogue 
(formula 1) which induces antibodies for 
hydrolyzing carboxamide substrate (formula 2). The 
compound of formula 1 containing a glutaryl spacer 
and a N-hydroxysuccinimide-linker appendage is the 
form used to couple the hapten (formula 1) to 
protein carriers KLH and BSA, while the compound of 
formula 3 is the inhibitor. The phosphonamidate 
functionality is a mimic of the stereo-electronic 
features of the transition state for hydrolysis of 
the amide bond. 

Figure 14 PCR amplification of Fd and 
kappa regions from the spleen mRNA of a mouse 
immunized with NPN is illustrated. Amplification 
was performed as described in Example 18 using RNA 
cDNA hybrids obtained by the reverse transcription 
of the mRNA with primer specific for amplification 
of light chain sequences (Table 2) or heavy chain 
sequences (Table 1) . Lanes F1-F8 represent the 
product of heavy chain amplification reactions with 
one of each of the eight 5 1 primers (primers 2-9, 
Table 1) and the unique 3 1 primer (primer 15 , Table 
2). Light chain (k) amplifications with the 5' 
primers (primers 3-6, and 12, respectively, Table 
2) and the appropriate 3* primer (primer 13, Table 
2) are shown in lanes F9-F13. A band of 700 bps is 
seen in all lanes indicating the successful 
amplification of Fd and k regions. 

Figure 15 The screening of phage 
libraries for antigen binding is depicted according 
to Example 18C. Duplicate plaque lifts of Fab 
(filters A,B), heavy chain (filters E,F) and light 
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chain (filters G,H) expression libraries were 
screened against 125 I-labelled BSA conjugated with 
NPN at a density of approximately 30,000 plaques 
per plate. Filters C and D illustrate the 
duplicate secondary screening of a cored positive 
from a primary filter A (arrows) as discussed in 
the text. 

Screening employed standard plaque lift 
methods. XL1 Blue cells infected with phage were 
incubated on 150mm plates for 4h at 37°, protein 
expression induced by overlay with nitrocellulose 
filters soaked in lOmM isopropyl thiogalactoside 
(IPTG) and the plates incubated at 25° for 8h. 
Duplicate filters were obtained during a second 
incubation employing the same conditions. Filters 
were then blocked in a solution of 1% BSA in PBS 
for lh before incubation with rocking at 25° for lh 
with a solution of 125 I-labellQd BSA conjugated to 
NPN (2 x 10 6 cpm ml" 1 ; BSA concentration at 0*1 M; 
approximately 15 NPN per BSA molecule) in 1% 
BSA/ PBS. Background was reduced by pre- 
centrifugation of stock radiolabeled BSA solution 
at 100,000 g for 15 min and pre-incubation of 
solutions with plaque lifts from plates containing 
bacteria infected with a phage having no insert. 
After labeling, filters were washed repeatedly with 
PBS/0.05% Tween 20 before development of 
autoradiographs overnight. 

Figure 16 The specificity of antigen 
binding as shown by competitive inhibition is 
illustrated according to Example 18C. Filter lifts 
from positive plaques were exposed to 125 I-BSA-NPN 
in the presence of increasing concentrations of the 
inhibitor NPN. 

In this study a number of phages 



13 

correlated with NPN binding as in Figure 15 were 
spotted (about 100 particles per spot) directly 
onto a bacterial lawns. The plate was then 
overlaid with an IPTG-soaked filter and incubated 
for 19h at 25°. The filter were then blocked in 1% 
BSA in PBS prior to incubation in 125 I-BSA-NPN as 
described previously in Figure 15 except with the 
inclusion of varying amounts of NPN in the labeling 
solution. Other conditions and procedures were as 
in Figure 15. The results for a phage of moderate 
affinity are shown in duplicate in the figure. 
Similar results were obtained for four other phages 
with some differences in the effective inhibitor 
concentration ranges. 

Figure 17 The characterization of an 
antigen binding protein is illustrated according to 
Example 18D. The concentrated partially purified 
bacterial supernate of an NPN-binding clone was 
separated by gel filtration and aliquot s from each 
fraction applied to microtitre plates coated with 

BSA-NPN. Addition of either anti-decapeptide ( ) 

or anti-kappa chain ( ) antibodies conjugated 

with alkaline phosphatase was followed by color 
development. The arrow indicates the position of 
elution of a known Fab fragment. The results show 
that antigen binding is a property of 50 kD protein 
containing both heavy and light chains. 

Single plaques of two NPN-positive clones 
(Figure 15) were picked and the plasmid containing 
the heavy and light chain inserts excised (19) . 
500 ml cultures in L-broth were inoculated with 3 
ml of a saturated culture containing the excised 
plasmids and incubated for 4h at 37 . Proteins 
synthesis was induced by the addition of IPTG to a 
final concentration of ImM and the cultures 
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incubated for lOh at 25°. 200 ml of cells supernate 
were concentrated to 2 ml and applied to a TSK- 
G4000 column. 50 /xl aliquots from the eluted 
fractions were assayed by ELISA. 
5 For ELISA analysis, microtitre plates 

were coated with BSA-NPN at 1 ug/ml, 50 /xl samples 
mixed with 50 /xl PBS-Tween 20 (0.05%)-BSA (0.1%) 
added and the plates incubated for 2h at 25°. 
After washing with PBS-Tween 20-BSA, 50 /xl of 

10 appropriate concentrations of a rabbit anti- 

decapeptide antibody (20) and a goat anti-mouse 
kappa light chain (Southern Biotech) antibody 
conjugated with alkaline phosphatase were added and 
incubated for 2h at 25°. After further washing, 50 

15 /xl of p-nitrophenyl phosphate (1 mg/ml in 0.1M tris 
pH 9.5 containing 50 mM MgC12) were added and the 
plates incubated for 15-30 min before reading the 
OD at 405 nm. 

Figure 18 The sequence of the synthetic 

20 DNA insert inserted into Lambda Zap II V H to produce 
the selectable V H expression vector (panel A) and 
Lambda Zap II V L II according to Example 17 to 
produce the selectable V L expression vector (panel 
B). 

25 

figure J.9 

(A) The major features of the selectable 
V L expression vector are shown in panel A. The 
feature of the synthetic DNA sequence from Figure 

30 18 A is shown at the top along with the T 3 polymerase 
promoter from Lambda Zap II. The orientation of 
the insert in Lambda Zap II is shown. The V H DNA 
homologs are inserted into the Xho I and Spe I 
restriction enzyme sites. The V H DNA homologs are 

35 inserted into the Xho I and Spe I site and the read 
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through transcription produces the decapeptide 
epitope (tag) that is located just 3 1 of the 
cloning sites, 

(B) The major features of the bacterial 
5 expression vector Lambda Zap II V H (V H -expression 

vector) are shown, the synthetic DNA sequence from 
Figure 6 is shown at the top along with the T 3 
polymerase promoter from Lambda Zap II. The 
orientation of the insert in Lambda Zap II is 

10 shown. The V H DNA homologs are inserted into the 
Xho I and Spe I restriction enzyme sites. The V H 
DNA are inserted into the Xho I and Spe I site and 
the read through transcription produces the 
decapeptide epitope (tag) that is located just 3* 

15 of the cloning sites. 

Figure 20 One of the vectors for 
expression V H and V L in combination are shown. The 
various essential components of these vectors are 
shown. The selectable marker (sup F) is shown. 

20 

Detailed Description of the Invention 
A. Definitions 

Nucleotide ; a monomer ic unit of DNA 
or RNA consisting of a sugar moiety (pentose) , a 

25 phosphate, and a nitrogenous heterocyclic base. 
The base is linked to the sugar moiety via the 
glycosidic carbon (1* carbon of the pentose) and 
that combination of base and sugar is a nucleoside. 
When the nucleoside contains a phosphate group 

30 bonded to the 3' or 5' position of the pentose it 
is referred to as a nucleotide. 

Base Pair (bp) : a partnership of 
adenine (A) with thymine (T) , or of cytosine (C) 
with guanine (G) in a double stranded DNA molecule. 

35 In RNA, uracil (U) is substituted for thymine. 
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Nucleic Acid : a polymer of 
nucleotides, either single or double stranded. 

Gene ; a nucleic acid whose 
nucleotide sequence codes for a RNA or polypeptide. 
5 A gene can be either RNA or DNA. 

Complementary Bases : nucleotides 
that normally pair up when DNA or RNA adopts a 
double stranded configuration. 

Complementary Nucleotide Sequence : 
10 a sequence of nucleotides in a single-stranded 
molecule of DNA or RNA that is sufficiently 
complementary to that on another single strand to 
specifically hybridize to it with consequent 
hydrogen bonding. 
15 Conserved : a nucleotide sequence is 

conserved with respect to a preselected (reference) 
sequence if it non-randomly hybridizes to an exact 
complement of the preselected sequence. 

Hybridization : the pairing of 
20 substantially complementary nucleotide sequences 
(strands of nucleic acid) to form a duplex or 
heteroduplex by the establishment of hydrogen bonds 
between complementary base pairs. It is a 
specific, i.e. non-random, interaction between two 
25 complementary polynucleotide that can be 
competitively inhibited. 

Nucleotide Analog: a purine or 
pyrimidine nucleotide that differs structurally 
from a, T, G, C, or U, but is sufficiently similar 
30 to substitute for the normal nucleotide in a 
nucleic acid molecule. 

DNA Homoloa : is a nucleic acid 
having a preselected conserved nucleotide sequence 
and a sequence coding for a receptor capable of 
35 binding a preselected ligand. 
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Antibody : The term antibody in its 
various grammatical forms is used herein to refer 
to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., 
5 molecules that contain an antibody combining site 
or paratope. Exemplary antibody molecules are 
intact immunoglobulin molecules, substantially 
intact immunoglobulin molecules and portions of an 
immunoglobulin molecule, including those portions 
10 known in the art as Fab, Fab 1 , F(ab f ) 2 and F(v) . 

Antibody Combining Site : An 
antibody combining site is that structural portion 
of an antibody molecule comprised of a heavy and 
light chain variable and hypervariable regions that 
15 specifically binds (immunoreacts with) an antigen. 
The term immunoreact in its various forms means 
specific binding between an antigenic determinant- 
containing molecule and a molecule containing an 
antibody combining site such as a whole antibody 
20 molecule or a portion thereof. 

Monoclonal Antibody : The phrase 
monoclonal antibody in its various grammatical 
forms refers to a population of antibody molecules 
that contains only one species of antibody 
25 combining site capable of immunoreacting with a 
particular antigen. A monoclonal antibody thus 
typically displays a single binding affinity for 
any antigen with which it immunoreacts. A 
monoclonal antibody may therefore contain an 
30 antibody molecule having a plurality of antibody 
combining sites, each immunospecif ic for a 
* different antigen, e.g., a bispecific monoclonal 

antibody . 



35 
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The present invention contemplates a 
method of isolating from a repertoire of conserved 
genes a gene coding for a receptor having a 
preselected activity, preferably a catalytic 
5 activity- The receptor can be a polypeptide, an 
RNA molecule, such as a transfer RNA, an RNA 
displaying enzymatic activity, and the like. 
Preferably, the receptor will be a polypeptide 
capable of binding a ligand, such as an enzyme, 

10 antibody molecule or immunologically active portion 
thereof, cellular receptor, or cellular adhesion 
protein coded for by one of the members of a family 
of conserved genes, i.e., genes containing a 
conserved nucleotide sequence of at least about 10 

15 nucleotides in length. 

Exemplary conserved gene families are 
those coding for immunoglobulins, major 
histocompatibility complex antigens of class I or 
II, lymphocyte receptors, integrins and the like. 

20 A gene can be identified as belonging to 

a repertoire of conserved genes using several 
methods. For example, an isolated gene may be used 
as a hybridization probe under low stringency 
conditions to detect other members of the 

25 repertoire of conserved genes present, in genomic 
DNA using the methods described by Southern, J. 
Mol. Biol. . 98:503 (1975). If the gene used as a 
hybridization probe hybridizes to multiple 
restriction endonuclease fragments that gene is a 

30 member of a repertoire of conserved genes. 



Tmmunoqlobulins 

The immunoglobulins, or antibody 
molecules, are a large family of molecules that 
35 include several types of molecules, such as IgD, 
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IgG, IgA, IgM and IgE. The antibody molecule is 
typically comprised of two heavy (H) and light (L) 
chains with both a variable (V) and constant (C) 
region present on each chain. Several different 
5 regions of an immunoglobulin contain conserved 

sequences useful for isolating an immunoglobulin 
repertoire. Extensive amino acid and nucleic acid 
sequence data displaying exemplary conserved 
sequences is compiled for immunoglobulin molecules 

10 by Kabat et al . , in Sequences of Proteins of 

Immunological Interest , National Institutes of 
Health, Bethesda, MD, 1987. 

The C region of the H chain defines the 
particular immunoglobulin type. Therefore the 

15 selection of conserved sequences as defined herein 
from the C region of the H chain results in the 
preparation of a repertoire of immunoglobulin genes 
having members of the immunoglobulin type of the 
selected C region. 

20 The V region of the H or L chain 

typically comprises four framework (FR) regions 
each containing relatively lower degrees of 
variability that includes lengths of conserved 
sequences. The use of conserved sequences from the 

25 FR1 and FR4 (J region ) framework regions of the V H 
chain is a preferred exemplary embodiment and is 
described herein in the Examples. Framework 
regions are typically conserved across several or 
all immunoglobulin types and thus conserved 

30 sequences contained therein are particularly suited 
for preparing repertoires having several 
immunoglobulin types. 

Maior Histocompatibility Complex 
35 The major histocompatibility complex 
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(MHC) is a large genetic locus that encodes an 
extensive family of proteins that include several 
classes of molecules referred to as class I, class 
II or class III MHC molecules. Paul et al., in 
5 Fundamental Immunology . Raven Press, NY, pp. 303- 
378 (1984). 

Class I MHC molecules are a polymorphic 
group of transplantation antigens representing a 
conserved family in which the antigen is comprised 
10 of a heavy chain and a non-MHC encoded light chain. 
The heavy chain includes several regions, termed 
the N, CI, C2, membrane and cytoplasmic regions. 
Conserved sequences useful in the present invention 
are found primarily in the N, CI and C2 regions and 
15 are identified as continuous sequences of 

"invariant residues" in Kabat et al., supra . 

Class II MHC molecules comprise a 
conserved family of polymorphic antigens that 
participate in immune responsiveness and are 
20 comprised of an alpha and a beta chain. The genes 
coding for the alpha and beta chain each include 
several regions that contain conserved sequences 
suitable for producing MHC class II alpha or beta 
chain repertoires. Exemplary conserved nucleotide 
25 sequences include those coding for amino acid 

residues 26-30 of the Al region, residues 161-170 
of the A2 region and residues 195-206 of the 
membrane region, all of the alpha chain. Conserved 
sequences are also present in the Bl, B2 and 
30 membrane regions of the beta chain at nucleotide 
sequences coding for amino acid residues 41-45, 
150-162 and 200-209, respectively. 

Lymphocyte Recep tors and Cell Surface Antigens 
35 Lymphocytes contain several families of 
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proteins on their cell surfaces including the T- 
cell receptor, Thy-1 antigen and numerous T-cell 
surface antigens including the antigens defined by 
the monoclonal antibodies 0KT4 (leu3), OKUT5/8 
5 (leu2), OKUT3 , OKUT1 (leul) , OKT 11 (leu5) OKT6 and 

OKT9. Paul, supra at pp. 458-479. 

The T-cell receptor is a term used for a 
family of antigen binding molecules found on the 
surface of T-cells. The T-cell receptor as a 

10 family exhibits polymorphic binding specificity 

similar to immunoglobulins in its diversity. The 
mature T-cell receptor is comprised of alpha and 
beta chains each having a variable (V) and constant 
(C) region. The similarities that the T-cell 

15 receptor has to immunoglobulins in genetic 

organization and function shows that T-cell 
receptor contains regions of conserved sequence. 
Lai et al., Nature . 331:543-546 (1988). 

Exemplary conserved sequences include 

20 those coding for amino acid residues 84-90 of alpha 
chain, amino acid residues 107-115 of beta chain, 
and amino acid residues 91-95 and 111-116 of the 
gamma chain. Kabat et al., supra . p. 279. 

25 

Inteqrins And Adhesions 

Adhesive proteins involved in cell 
attachment are members of a large family of related 
proteins termed integrins. Integrins are 

30 heterodimers comprised of a beta and an alpha 

subunit. Members of the integrin family include 
the cell surface glycoproteins platelet receptor 
GpIIb-IIIa, vitronectin, receptor (VnR) fibronectin 
receptor (FnR) and the leukocyte adhesion receptors 

35 LFA-1, Mac-1, Mo-1 and 60.3. Roushahti et al., 
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Science . 238:491-497 (1987). Nucleic acid and 
protein sequence data demonstrates regions of 
conserved sequences exist in the members of these 
families particularly between the beta chain of 
GpIIb-IIIa VnR and FnR, and between the alpha 
subunit of VnR, Mac-1, LFA-1, Fnr and GpIIb-IIIa. 
Suzuki et al., Proc. Natl. Acad. Sci. USA . 83:8614- 
8618, 1986; Ginsberg et al., J, Biol. Chem. , 
262:5437-5440, 1987. 

The following discussion illustrates the 
method of the present invention applied to 
isolating a conserved receptor-coding gene from the 
immunoglobulin gene repertoire. This discussion is 
not to be taken as limiting, but rather as 
illustrating application of principles that can be 
used to isolate a gene from any family of conserved 
genes coding for functionally related receptors. 

Generally, the method combines the 
following elements: 

1. Isolating nucleic acids containing a 
substantial portion of the immunological 
repertoire. 

2. Preparing polynucleotide primers for 
cloning polynucleotide segments containing 
immunoglobulin V H and/or V L region genes. 

3. Preparing a gene library containing 
a plurality of different V H and V L genes from the 
repertoire. 

4. Expressing the V H and/ or V L 
polypeptides in a suitable host, including 
prokaryotic and eukaryotic hosts, either separately 
or in the same cell, and either on the same or 
different expression vectors. 

5. Screening the expressed polypeptides 
for the preselected activity, and segregating a V H - 
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and/or V L -coding gene identified by the screening 
process . 

A receptor produced by the present 
invention assumes a conformation having a binding 
5 site specific for as evidenced by its ability to be 

competitively inhibited, a preselected or 
predetermined ligand such as an antigen, enzymatic 
substrate and the like. In one embodiment, a 
receptor of this invention is a ligand binding 

10 polypeptide that forms an antigen binding site 

which specifically binds to a preselected antigen 
to form a complex having a sufficiently strong 
binding between the antigen and the binding site 
for the complex to be isolated. When the receptor 

15 is an antigen binding polypeptide its affinity or 
avidity is generally greater than 10 5 - M" 1 more 
usually greater than 10 6 and preferably greater than 
10 8 M M . 

In another embodiment, a receptor of the 

20 subject invention binds a substrate and catalyzes 
the formation of a product from the substrate. 
While the topology of the ligand binding site of a 
catalytic receptor is probably more important for 
its preselected activity than its affinity 

25 (association constant or pKa) for the substrate, 

the subject catalytic receptors have an association 
constant for the preselected substrate generally 
greater than 10 3 M" 1 , more usually greater than 10 5 
M" 1 or 10 6 M" 1 and preferably greater than 10 7 M* 1 . 

30 Preferably the receptor produced by the 

subject invention is heterodimeric and is therefore 
normally comprised of two different polypeptide 
chains, which together assume a conformation having 
a binding affinity, or association constant for the 

35 preselected ligand that is different, preferably 
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higher, than the affinity or association constant 
of either of the polypeptides alone, i.e., as 
monomers. One or both of the different polypeptide 
chains is derived from the variable region of the 
light and heavy chains of an immunoglobulin. 
Typically, polypeptides comprising the light (V L ) 
and heavy (V H ) variable regions are employed 
together for binding the preselected ligand. 

A receptor produced by the subject 
invention can be active in monomeric as well as 
multimeric forms, either homomeric or heteromeric, 
preferably heterodimeric . For example, V H and V L 
ligand binding polypeptide produced by the present 
invention can be advantageously combined in the 
heterodimer to modulate the activity of either or 
to produce an activity unique to the heterodimer. 
The individual ligand binding polypeptides will be 
referred to as V H and V L and the heterodimer will be 
referred to as a Fv. 

However, it should be understood that a V H 
binding polypeptide may contain in addition to the 
V H , substantially all or a portion of the heavy 
chain constant region. A V L binding polypeptide may 
contain, in addition to the V L , substantially all or 
a portion of the light chain constant region. A 
heterodimer comprised of a V H binding polypeptide 
containing a portion of the heavy chain constant 
region and a V L binding containing substantially all 
of the light chain constant region is termed a Fab 
fragment. The production of Fab can be 
advantageous in some situations because the 
additional constant region sequences contained in a 
Fab as compared to a F v could stabilize the V H and 
V L interaction. Such stabilization could cause the 
Fab to have higher affinity for antigen. In 
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addition the Fab is more commonly used in the art 
and thus there are more commercial antibodies 
available to specifically recognize a Fab. 

The individual V H and V L polypeptides will 
generally have fewer than 125 amino acid residues, 
more usually fewer than about 120 amino acid 
residues, while normally having greater than 60 
amino acid residues, usually greater than about 95 
amino acid residues, more usually greater than 
about 100 amino acid residues. Preferably, the V H 
will be from about 110 to about 125 amino acid 
residues in length while V L will be from about 95 to 
about 115 amino acid residues in length. 

The amino acid residue sequences will 
vary widely, depending upon the particular idiotype 
involved. Usually, there will be at least two 
cysteines separated by from about 60 to 75 amino 
acid residues and joined by a disulfide bond. The 
polypeptides produced by the subject invention will 
normally be substantial copies of idiotypes of the 
variable regions of the heavy and/or light chains 
of immunoglobulins, but in some situations a 
polypeptide may contain random mutations in amino 
acid residue sequences in order to advantageously 
improve the desired activity. 

In some situations, it is desirable to 
provide for covalent cross linking of the V H and V L 
polypeptides, which can be accomplished by 
providing cysteine resides at the carboxyl termini. 
The polypeptide will normally be prepared free of 
the immunoglobulin constant regions, however a 
small portion of the J region may be included as a 
result of the advantageous selection of DNA 
synthesis primers. The D region will normally be 
included in the transcript of the V H . 
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In other situations, it is desirable to 
provide a peptide linker to connect the V L and the 
V H to form a single-chain antigen-binding protein 
comprised of a V H and a V L . This single-chain 
5 antigen-binding protein would be synthesized as. a 
single protein chain. Such single-chain antigen- 
binding proteins have been described by Bird et 
al., Science . 242:423-426 (1988). The design of 
suitable peptide linker regions is described in 

10 U.S. Patent No. 4 , 704 , 692 by Robert Landner. 

Such a peptide linker could be designed 
as part of the nucleic acid sequences contained in 
the expression vector. The nucleic acid sequences 
coding for the peptide linker would be between the 

15 V H and v L DNA homologs and the restriction 

endonuclease sites used to operatively link the V H 
an V L DNA homologs to the expression vector. 

Such a peptide linker could also be coded 
for nucleic acid sequences that are part of the 

20 polynucleotide primers used to prepare the various 
gene libraries. The nucleic acid sequence coding 
for the peptide linker can be made up of nucleic 
acids attached to one of the primers or the nucleic 
acid sequence coding for the peptide linker may be 

25 derived from nucleic acid sequences that are 

attached to several polynucleotide primers used to 
create the gene libraries. 

Typically the C terminus region of the V H 
and V L polypeptides will have a greater variety of 

30 the sequences than the N terminus and f based on the 
present strategy, can be further modified to permit 
a variation of the normally occurring V H and V L 
chains. A synthetic polynucleotide can be employed 
to vary one or more amino in an hypervariable 

35 region. 
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1. Isolation Of The Repertoire 
To prepare a composition of nucleic acids 
containing a substantial portion of the 
5 immunological gene repertoire, a source of genes 
coding for the V H and/or V L polypeptides is 
required. Preferably the source will be a 
heterogeneous population of antibody producing 
cells, i.e. B lymphocytes (B cells), preferably 
10 rearranged B cells such as those found in the 

circulation or spleen of a vertebrate. (Rearranged 
B cells are those in which immunoglobulin gene 
translocation, i.e., rearrangement, has occurred as 
evidenced by the presence in the cell of mRNA with 
15 the immunoglobulin gene V, D and J region 
transcripts adjacently located thereon.) 

In some cases, it is desirable to bias 
the repertoire for a preselected activity, such as 
by using as a source of nucleic acid cells (source 
20 cells) from vertebrates in any one of various 

stages of age, health and immune response. For 
example, repeated immunization of a healthy animal 
prior to collecting rearranged B cells results in 
obtaining a repertoire enriched for genetic 
25 material producing a ligand binding polypeptide of 
high affinity. Conversely, collecting rearranged B 
cells from a healthy animal whose immune system has 
not been recently challenged results in producing a 
repertoire that is not biased towards the 
30 production of high affinity V H and/or V L 
polypeptides. 

It should be noted the greater the 
genetic heterogeneity of the population of cells 
for which the nucleic acids are obtained, the 
35 greater the diversity of the immunological 
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repertoire that will be made available for 
screening according to the method of the present 
invention. Thus, cells from different individuals, 
particularly those having an immunologically 
significant age difference, and cells from 
individuals of different strains, races or species 
can be advantageously combined to increase the 
heterogeneity of the repertoire. 

Thus, in one preferred embodiment, the 
source cells are obtained from a vertebrate, 
preferably a mammal, which has been immunized or 
partially immunized with an antigenic ligand 
(antigen) against which activity is sought, i.e., a 
preselected antigen. The immunization can be 
carried out conventionally. Antibody titer in the 
animal can be monitored to determine the stage of 
immunization desired, which stage corresponds to 
the amount of enrichment or biasing of the 
repertoire desired. Partially immunized animals 
typically receive only one immunization and cells 
are collected therefrom shortly after a response is 
detected. Fully immunized animals display a peak 
titer, which is achieved with one or more repeated 
injections of the antigen into the host mammal, 
normally at 2 to 3 week intervals. Usually three 
to five days after the last challenge, the spleen 
is removed and the genetic repertoire of the 
spleenocytes, about 90% of which are rearranged B 
cells, is isolated using standard procedures. See, 
Current Protocols in Molecular Biology . Ausubel et 
al., eds., John Wiley & Sons, NY. Nucleic acids 
coding for V H and V L polypeptides can be derived 
from cells producing IgA, IgD, IgE, IgG or IgM, 
most preferably from IgM and IgG, producing cells. 

Methods for preparing fragments of 
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genomic DNA from which immunoglobulin variable 
region genes can be cloned as a diverse population 
are well known in the art. See for example 
Herrmann et al., Methods In Enzvmol . , 152:180-183, 
(1987); Frischauf, Methods In Enzvmol. , 152:183-190 
(1987); Frischauf, Methods In Enzvmol - . 152:190-199 
(1987); and DiLella et al., Methods In Enzvmol.. 
152:199-212 (1987). (The teachings of the 
references cited herein are hereby incorporated by 
reference. ) 

The desired gene repertoire can be 
isolated from either genomic material containing 
the gene expressing the variable region or the 
messenger RNA (mRNA) which represents a transcript 
of the variable region. The difficulty in using 
the genomic DNA from other than non-rearranged B 
lymphocytes is in juxtaposing the sequences coding 
for the variable region, where the sequences are 
separated by introns. The DNA fragment (s) 
containing the proper exons must be isolated, the 
introns excised, and the exons then spliced in the 
proper order and in the proper orientation. For 
the most part, this will be difficult, so that the 
alternative technique employing rearranged B cells 
will be the method of choice because the C D and J 
immunoglobulin gene regions have translocated to 
become adjacent, so that the sequence is continuous 
(free of introns) for the entire variable regions. 

Where mRNA is utilized the cells will be 
lysed under RNase inhibiting conditions. In one 
embodiment, the first step is to isolate the total 
cellular mRNA by hybridization to an oligo-dT 
cellulose column. The presence of mRNAs coding for 
the heavy and/or light chain polypeptides can then 
be assayed by hybridization with DNA single strands 
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of the appropriate genes. Conveniently, the 
sequences coding for the constant portion of the V H 
and V L can be used as polynucleotide probes, which 
sequences can be obtained from available sources. 
See for example, Early and Hood, Genetic 
Engineering , Setlow and Hollaender, eds., Vol, 3, 
Plenum Publishing Corporation, NY, (1981) , pages 
157-188; and Kabat et al., Sequences of 
Immunological Interest . National Institutes of 
Health, Bethesda, MD, (1987). In preferred 

embodiments, the preparation containing the total 
cellular mRNA is first enriched for the presence of 
V H and/or V L coding mRNA. Enrichment is typically 
accomplished by subjecting the total mRNA 
preparation or partially purified mRNA product 
thereof to a primer extension reaction employing a 
polynucleotide synthesis primer of the present 
invention. 

2. preparation Of Polynucleotide 
Primers 

The term "polynucleotide" as used herein 
in reference to primers, probes and nucleic acid 
fragments or segments to be synthesized by primer 
extension is defined as a molecule comprised of two 
or more deoxyribonucleotides or ribonucleotides, 
preferably more than 3. Its exact size will depend 
on many factors, which in turn depends on the 
ultimate conditions of use. 

The term "primer" as used herein refers 
to a polynucleotide whether purified from a nucleic 
acid restriction digest or produced synthetically, 
which is capable of acting as a point of initiation 
of synthesis when placed under conditions in which 
synthesis of a primer extension product which is 
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complementary to a nucleic acid strand is induced, 
i.e., in the presence of nucleotides and an agent 
for polymerization such as DNA polymerase, reverse 
transcriptase and the like, and at a suitable 
temperature and pH. The primer is preferably 
single stranded for maximum efficiency, but may 
alternatively be double stranded. If double 
stranded, the primer is first treated to separate 
its strands before being used to prepare extension 
products. Preferably, the primer is a 
polydeoxyribonucleotide. The primer must be 
sufficiently long to prime the synthesis of 
extension products in the presence of the agents 
for polymerization. The exact lengths of the 
primers will depend on may factors, including 
temperature and the source of primer. For example, 
depending on the complexity of the target sequence, 
a polynucleotide primer typically contains 15 to 25 
or more nucleotides, although it can contain fewer 
nucleotides. Short primer molecules generally 
require cooler temperatures to form sufficiently 
stable hybrid complexes with template. 

The primers used herein are selected to 
be "substantially" complementary to the different 
strands of each specific sequence to be synthesized 
or amplified. This means that the primer must be 
sufficiently complementary to non-randomly 
hybridize with its respective template strand. 
Therefore, the primer sequence may not reflect the 
exact sequence of the template. For example, a 
non-complementary nucleotide fragment can be 
attached to the 5 f end of the primer, with the 
remainder of the primer sequence being 
substantially complementary to the strand. Such 
non-complementary fragments typically code for an 
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endonuclease restriction site. Alternatively, non- 
complementary bases or longer sequences can be 
interspersed into the primer, provided the primer 
sequence has sufficient complementarily with the 
sequence of the strand to be synthesized or 
amplified to non-randomly hybridize therewith and 
thereby form an extension product under 
polynucleotide synthesizing conditions. 

The polynucleotide primers can be 
prepared using any suitable method, such as, for 
example, the phosphotriester on phosphodiester 
methods see Narang et al., Meth. Enzymol. r 68:90, 
(1979); U.S. Patent No. 4,356,270; and Brown et 
al., Meth. Enzymol. . 68:109, (1979). 

The choice of a primer's nucleotide 
sequence depends on factors such as the distance on 
the nucleic acid from the region coding for the 
desired receptor, its hybridization site on the 
nucleic acid relative to any second primer to be 
used, the number of genes in the repertoire it is 
to hybridize to, and the like. 

For example, to produce V H -coding DNA 
homologs by primer extension, the nucleotide 
sequence of a primer is selected to hybridize with 
a plurality of immunoglobulin heavy chain genes at 
a site substantially adjacent to the V H -coding 
region so that a nucleotide sequence coding for a 
functional (capable of binding) polypeptide is 
obtained. To hybridize to a plurality of different 
V H -coding nucleic acid strands, the primer must be a 
substantial complement of a nucleotide sequence 
conserved among the different strands. Such sites 
include nucleotide sequences in the constant 
region, any of the variable region framework 
regions, preferably the third framework region, 
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leader region, promoter region, J region and the 
like. 

If the V H -coding and V L -coding DNA 
homologs are to be produced by polymerase chain 
5 reaction (PCR) amplification, two primers must be 
used for each coding strand of nucleic acid to be 
amplified. The first primer becomes part of the 
nonsense (minus or complementary) strand and 
hybridizes to a nucleotide sequence conserved among 

10 V H (plus) strands within the repertoire. To produce 
V H coding DNA homologs, first primers are therefore 
chosen to hybridize to (i.e. be complementary to) 
conserved regions within the J region, CHI region, 
hinge region, CH2 region, or CH3 region of 

15 immunoglobulin genes and the like. To produce a V L 
coding DNA homolog, first primers are chosen to 
hybridize with (i.e. be complementary to) a 
conserved region within the J region or constant 
region of immunoglobulin light chain genes and the 

20 like. Second primers become part of the coding 
(plus) strand and hybridize to a nucleotide 
sequence conserved among minus strands. To produce 
the V H -coding DNA homologs, second primers are 
therefore chosen to hybridize with a conserved 

25 nucleotide sequence at the 5' end of the V H -coding 
immunoglobulin gene such as in that area coding for 
the leader or first framework region. It should be 
noted that in the amplification of both V H - and V t - 
coding DNA homologs the conserved 5' nucleotide 

30 sequence of the second primer can be complementary 
to a sequence exogenously added using terminal 
deoxynucleotidyl transferase as described by Loh et 
al., Sci. Vol 243:217-220 (1989). One or both of 
the first and second primers can contain a 

35 nucleotide sequence defining an endonuclease 
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recognition site. The site can be heterologous to 
the immunoglobulin gene being amplified and 
typically appears at or near the 5 1 end of the 
primer. 

Primers of the present invention may also 
contain a DNA-dependent RNA polymerase promoter 
sequence or its complement. See for example, Krieg 
et al.. Nucleic Acids Research P 12:7057-70 (1984); 
Studier et al., J. Mol. Biol. . 189:113-130 (1986); 
and Molecular Cloning; A Laboratory Manual, Second 
Edition . Maniatis et al., eds., Cold Spring Harbor, 
NY (1989) . 

When a primer containing a DNA-dependent 
RNA polymerase promoter is used the primer is 
hybridized to the polynucleotide strand to be 
amplified and the second polynucleotide strand of 
the DNA-dependent RNA polymerase promoter is 
completed using an inducing agent such as E. coli P 
DNA polymerase I, or the Klenow fragment of E. coli 
DNA polymerase. The starting polynucleotide is 
amplified by alternating between the production of 
an RNA polynucleotide and DNA polynucleotide. 

Primers may also contain a template 
sequence or replication initiation site for a RNA- 
directed RNA polymerase. Typical RNA-directed RNA 
polymerase include the QB replicase described by 
Lizardi et al., Biotechnology . 6:1197-1202 (1988). 

RNA-directed polymerases produce large 
numbers of RNA strands from a small number of 
template RNA strands that contain a template 
sequence or replication initiation site. These 
polymerases typically give a one million-fold 
amplification of the template strand as has been 
described by Kramer et al., J. Mol, Biol. . 89:719- 
736 (1974). 



35 



3 . Preparing a Gene Library 
The strategy used for cloning, i.e., 
substantially reproducing, the V H and/or V L genes 
contained within the isolated repertoire will 
depend, as is well known in the art, on the type, 
complexity, and purity of the nucleic acids making 
up the repertoire. Other factors include whether 
or not the genes are to be amplified and/or 
mutagenized. 

In one strategy, the object is to clone 
the V H - and/or V L -coding genes from a repertoire 
comprised of polynucleotide coding strands, such as 
mRNA and/or the sense strand of genomic DNA. If 
the repertoire is in the form of double stranded 
genomic DNA, it is usually first denatured, 
typically by melting, into single strands. The 
repertoire is subjected to a first primary 
extension reaction by treating (contacting) the 
repertoire with a first polynucleotide synthesis 
primer having a preselected nucleotide sequence. 
The first primer is capable of initiating the first 
primer extension reaction by hybridizing to a 
nucleotide sequence, preferably at least about 10 
nucleotides in length and more preferably at least 
about 20 nucleotides in length, conserved within 
the repertoire. The first primer is sometimes 
referred to herein as the "sense primer" because it 
hybridizes to the coding or sense strand of a 
nucleic acid. In addition, the second primer is 
sometimes referred to herein as the "anti-sense 
primer" because it hybridizes to a non-coding or 
anti-sense strand of a nucleic acid, i.e., a strand 
complementary to a coding strand. 

The first primer extension is performed 
by mixing the first primer, preferably a 
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predetermined amount thereof, with the nucleic 
acids of the repertoire, preferably a predetermined 
amount thereof, to form a first primer extension 
reaction admixture. The admixture is maintained 
under polynucleotide synthesizing conditions for a 
time period, which is typically predetermined, 
sufficient for the formation of a first primer 
extension reaction product, thereby producing a 
plurality of different V H -coding DNA homolog 
complements. The complements are then subjected to 
a second primer extension reaction by treating them 
with a second polynucleotide synthesis primer 
having a preselected nucleotide sequence. The 
second primer is capable of initiating the second 
reaction by hybridizing to a nucleotide sequence, 
preferably at least about 10 nucleotides in length 
and more preferably at least about 20 nucleotides 
in length, conserved among a plurality of different 
V H -coding gene complements such as those, for 
example, produced by the first primer extension 
reaction* This is accomplished by mixing the 
second primer, preferably a predetermined amount 
thereof, with the complement nucleic acids, 
preferably a predetermined amount thereof, to form 
a second primer extension reaction admixture. The 
admixture is maintained under polynucleotide 
synthesizing conditions for a time period, which is 
typically predetermined, sufficient for the 
formation of a first primer extension reaction 
product, thereby producing a gene library 
containing a plurality of different V H -and/or V L - 
coding DNA homologs. 

A plurality of first primer and/ or a 
plurality of second primers can be used in each 
amplification, or an individual pair of first and 
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second primers can be used. In any case, the 
amplification products of amplifications using the 
same or different combinations of first and second 
primers can be combined to increase the diversity 
of the gene library. 

In another strategy, the object is to 
clone the V H - and/or V L -coding genes from a 
repertoire by providing a polynucleotide complement 
of the repertoire, such as the anti-sense strand of 
genomic dsDNA or the polynucleotide produced by 
subjecting mRNA to a reverse transcriptase 
reaction. Methods for producing such complements 
are well known in the art. The complement is 
subjected to a primer extension reaction similar to 
the above-described second primer extension 
reaction, i.e., a primer extension reaction using a 
polynucleotide synthesis primer capable of 
hybridizing to a nucleotide sequence conserved 
among a plurality of different V H -coding gene 
complements. 

The primer extension reaction is 
performed using any suitable method. Generally it 
occurs in a buffered aqueous solution, preferably 
at a pH of 7-9, most preferably about 8. 
Preferably, a molar excess (for genomic nucleic 
acid, usually about 10 6 :1 primer: template) of the 
primer is admixed to the buffer containing the 
template strand. A large molar excess is preferred 
to improve the efficiency of the process. 

The deoxy ribonucleotide triphosphates 
dATP, dCTP, dGTP, and dTTP are also admixed to the 
primer extension (polynucleotide synthesis) 
reaction admixture in adequate amounts and the 
resulting solution is heated to about 90C - 100C 
for about 1 to 10 minutes, preferably from 1 to 4 
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minutes. After this heating period the solution is 
allowed to cool to room temperature , which is 
preferable for primer hybridization. To the cooled 
mixture is added an appropriate agent for inducing 
or catalyzing the primer extension reaction, and 
the reaction is allowed to occur under conditions 
known in the art. The synthesis reaction may occur 
at from room temperature up to a temperature above 
which the inducing agent no longer functions 
efficiently. Thus, for example, if DNA polymerase 
is used as inducing agent, the temperature is 
generally no greater than about 40C. 

The inducing agent may be any compound or 
system which will function to accomplish the 
synthesis of primer extension products, including 
enzymes. Suitable enzymes for this purpose 
include, for example, E. coli . DNA polymerase I, 
Klenow fragment of E. coli DNA polymerase I, T4 DNA 
polymerase, other available DNA polymerases, 
reverse transcriptase, and other enzymes, including 
heat-stable enzymes, which will facilitate 
combination of the nucleotides in the proper manner 
to form the primer extension products which are 
complementary to each nucleic acid strand. 
Generally, the synthesis will be initiated at the 
3 1 end of each primer and proceed in the 5 1 
direction along the template strand, until 
synthesis terminates, producing molecules of 
different lengths. There may be inducing agents, 
however , which initiate synthesis at the 5" end and 
proceed in the above direction, using the same 
process as described above. 

The inducing agent also may be a compound 
or system which will function to accomplish the 
synthesis of RNA primer extension products, 
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including enzymes. In preferred embodiments, the 
inducing agent may be a DNA-dependent RNA 
polymerase such as T7 RNA polymerase, T3 RNA 
polymerase or SP6 RNA polymerase. These 
5 polymerases produce a complementary RNA 

polynucleotide. The high turn over rate of the RNA 
polymerase amplifies the starting polynucleotide as 
has been described by Chamberlin et al., The 
Enzymes , ed. P. Boyer, PP. 87-108, Academic Press, 

10 New York (1982), Another advantage of T7 RNA 

polymerase is that mutations can be introduced into 
the polynucleotide synthesis by replacing a portion 
of cDNA with one or more mutagenic 
oligodeoxynucleotides (polynucleotides) and 

15 transcribing the partially-mismatched template 

directly as has been previously described by Joyce 
et al., Nucleic Acid Research , 17:711-722 (1989). 
Amplification systems based on transcription have 
been described by Gingeras et al . , in PCR 

20 Protocols. A Guide to Methods and Applications, pp 
245-252, Academic Press, Inc., San Diego, CA 
(1990) . 

If the inducing agent is a DNA-dependent 
RNA polymerase and therefore incorporates 
25 ribonucleotide triphosphates, sufficient amounts of 
ATP, CTP, GTP and UTP are admixed to the primer 
extension reaction admixture and the resulting 
solution is treated as described above. 

The newly synthesized strand and its 
30 complementary nucleic acid strand form a double- 
stranded molecule which can be used in the 
succeeding steps of the process. 

The first and/or second primer extension 
reaction discussed above can advantageously be used 
35 to incorporate into the receptor a preselected 
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epitope useful in immunologically detecting and/or 
isolating a receptor. This is accomplished by 
utilizing a first and/or second polynucleotide 
synthesis primer or expression vector to 
incorporate a predetermined amino acid residue 
sequence into the amino acid residue sequence of 
the receptor. 

After producing V H - and/or V L -coding DNA 
homologs for a plurality of different V H - and/or V L - 
coding genes within the repertoire, the homologs 
are typically amplified. While the V H and/or V L - 
coding DNA homologs can be amplified by classic 
techniques such as incorporation into an 
autonomously replicating vector, it is preferred to 
first amplify the DNA homologs by subjecting them 
to a polymerase chain reaction (PCR) prior to 
inserting them into a vector. In fact, in 
preferred strategies, the first and/or second 
primer extension reactions used to produce the gene 
library are the first and second primer extension 
reactions in a polymerase chain reaction. 

PCR is typically carried out by cycling 
i.e., simultaneously performing in one admixture, 
the above described first and second primer 
extension reactions, each cycle comprising 
polynucleotide synthesis followed by denaturation 
of the double stranded polynucleotides formed. 
Methods and systems for amplifying a DNA homolog 
are described in U.S. Patents No. 4,683,195 and 
No. 4,683,202, both to Mullis et al. 

In preferred embodiments only one pair of 
first and second primers is used per amplification 
reaction. The amplification reaction products 
obtained from a plurality of different 
amplifications, each using a plurality of different 
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primer pairs, are then combined. 

However, the present invention also 
contemplates DNA homolog production via co- 
amplification (using two pairs of primers) , and 
multiplex amplification (using up to about 8, 9 or 
10 primer pairs) . 

The V H - and V L -coding DNA homologs 
produced by PCR amplification are typically in 
double-stranded form and have contiguous or 
adjacent to each of their termini a nucleotide 
sequence defining an endonuclease restriction site. 
Digestion of the V H - and V L -coding DNA homologs 
having restriction sites at or near their termini 
with one or more appropriate endonucleases results 
in the production of homologs having cohesive 
termini of predetermined specificity. 

In preferred embodiments, the PCR process 
is used not only to amplify the V H - and/or V L -coding 
DNA homologs of the library, but also to induce 
mutations within the library and thereby provide a 
library having a greater heterogeneity. First, it 
should be noted that the PCR processes itself is 
inherently mutagenic due to a variety of factors 
well known in the art. Second, in addition to the 
mutation inducing variations described in the above 
referenced U.S. Patent No. 4,683,195, other 
mutation inducing PCR variations can be employed. 
For example, the PCR reaction admixture, i.e., the 
combined first and second primer extension reaction 
admixtures, can be formed with different amounts of 
one or more of the nucleotides to be incorporated 
into the extension product. Under such 
conditions, the PCR reaction proceeds to produce 
nucleotide substitutions within the extension 
product as a result of the scarcity of a particular 
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base. Similarly, approximately equal molar amounts 
of the nucleotides can be incorporated into the 
initial PGR reaction admixture in an amount to 
efficiently perform X number of cycles, and then 
cycling the admixture through a number of cycles in 
excess of X, such as, for instance, 2X. 
Alternatively, mutations can be induced during the 
PGR reaction by incorporating into the reaction 
admixture nucleotide derivatives such as inosine, 
not normally found in the nucleic acids of the 
repertoire being amplified. During subsequent in 
vivo amplification, the nucleotide derivative will 
be replaced with a substitute nucleotide thereby 
inducing a point mutation. 

4. Expressing the V M and/or V, DNA 
Homoloqs. 

The V H . and/or V L -coding DNA homologs 
contained within the library produced by the above- 
described method can be operatively linked to a 
vector for amplification and/or expression. 

As used herein, the term "vector" refers 
to a nucleic acid molecule capable of transporting 
between different genetic environments another 
nucleic acid to which it has been operatively 
linked. One type of preferred vector is an 
episome, i.e., a nucleic acid molecule capable of 
extra-chromosomal replication. Preferred vectors 
are those capable of autonomous replication and/or 
expression of nucleic acids to which they are 
linked. Vectors capable of directing the 
expression of genes to which they are operatively 
linked are referred to herein as "expression 
vectors" . 

The choice of vector to which a V H - and/or 
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V L -coding DNA homolog is operatively linked depends 
directly, as is well known in the art, on the 
functional properties desired, e.g., replication or 
protein expression, and the host cell to be 
transformed, these being limitations inherent in 
the art of constructing recombinant DNA molecules. 

In preferred embodiments, the vector 
utilized includes a prokaryotic replicon i.e., a 
DNA sequence having the ability to direct 
autonomous replication and maintenance of the 
recombinant DNA molecule extra chromosomally in a 
prokaryotic host cell, such as a bacterial host 
cell, transformed therewith. Such replicons are 
well known in the art. In addition, those 
embodiments that include a prokaryotic replicon 
also include a gene whose expression confers a 
selective advantage, such as drug resistance, to a 
bacterial host transformed therewith. Typical 
bacterial drug resistance genes are those that 
confer resistance to ampicillin or tetracycline. 

Those vectors that include a prokaryotic 
replicon can also include a prokaryotic promoter 
capable of directing the expression (transcription 
and translation) of the V H - and/or V L -coding 
homologs in a bacterial host cell, such as E. coli 
transformed therewith. A promoter is an expression 
control element formed by a DNA sequence that 
permits binding of RNA polymerase and transcription 
to occur. Promoter sequences compatible with 
bacterial hosts are typically provided in plasmid 
vectors containing convenience restriction sites 
for insertion of a DNA segment of the present 
invention. Typical of such vector plasmids are 
pUC8, pUC9, pBR322, and pBR329 available from 
BioRad Laboratories, (Richmond , CA) and pPL and 
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pKK223 available from Pharmacia, (Piscataway, NJ) . 

Expression vectors compatible with 
eukaryotic cells, preferably those compatible with 
vertebrate cells, can also be used. Eukaryotic 
cell expression vectors are well known in the art 
and are available from several commercial sources • 
Typically, such vectors are provided containing 
convenient restriction sites for insertion of the 
desired DNA homolog. Typical of such vectors are 
pSVL and pKSV-10 (Pharmacia) , pBPV-l/PML2d 
(International Biotechnologies, Inc.)/ and pTDTl 
(ATCC, No. 31255) . 

In preferred embodiments, the eukaryotic 
cell expression vectors used include a selection 
marker that is effective in an eukaryotic cell, 
preferably a drug resistant selection marker. A 
preferred drug resistance marker is the gene whose 
expression results in neomycin resistance, i.e., 
the neomycin phosphotransferase (neo) gene. 
Southern et al., J. Mol. ApdI. Genet. . 1:327-341 
(1982). 

The use of retroviral expression vectors 
to express the genes of the V H and/or V L -coding DNA 
homologs is also contemplated. As used herein, the 
term "retroviral expression vector" refers to a DNA 
molecule that includes a promoter sequences derived 
from the long terminal repeat (LTR) region of a 
retrovirus genome. 

In preferred embodiments, the expression 
vector is typically a retroviral expression vector 
that is preferably replication-incompetent in 
eukaryotic cells. The construction and use of 
retroviral vectors has been described by Sorge et 
al., Mol. Cel. Biol. , 4:1730-1737 (1984). 

A variety of methods have been developed 
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to operatively link DNA to vectors via 
complementary cohesive termini • For instance, 
complementary cohesive termini can be engineered 
into the V„- and/or V L -coding DNA homologs during 
the primer extension reaction by use of an 
appropriately designed polynucleotide synthesis 
primer, as previously discussed. The vector, and 
DNA homolog if necessary, is cleaved with a 
restriction endonuclease to produce termini 
complementary to those of the DNA homolog. The 
complementary cohesive termini of the vector and 
the DNA homolog are then operatively linked 
(ligated) to produce a unitary double stranded DNA 
molecule. 

In preferred embodiments, the V H -coding 
and V L -coding DNA homologs of diverse libraries are 
randomly combined in vitro for polycistronic 
expression from individual vectors. That is, a 
diverse population of double stranded DNA 
expression vectors is produced wherein each vector 
expresses, under the control of a single promoter, 
one V H -coding DNA homolog and one V L -coding DNA 
homolog, the diversity of the population being the 
result of different V H - and V L -coding DNA homolog 
combinations. Random combination in vitro can be 
accomplished using two expression vectors 
distinguished from one another by the location on 
each of a restriction site common to both. 
Preferably the vectors are linear double stranded 
DNA, such as a Lambda Zap derived vector as 
described herein. In the first vector, the site is 
located between a promoter and a polylinker, i.e., 
5 1 terminal (upstream relative to the direction of 
expression) to the polylinker but 3* terminal 
(downstream relative to the direction of 
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expression) • In the second vector, the polylinker 
is located between a promoter and the restriction 
site, i.e., the restriction site is located 3* 
terminal to the polylinker, and the polylinker is 
5 located 3' terminal to the promoter. 

In preferred embodiments, each of the 
vectors defines a nucleotide sequence coding for a 
ribosome binding and a leader, the sequence being 
located between the promoter and the polylinker, 

10 but downstream (3 1 terminal) from the shared 
restriction site if that site is between the 
promoter and polylinker. Also preferred are 
vectors containing a stop codon downstream from the 
polylinker, but upstream from any shared 

15 restriction site if that site is downstream from 

the polylinker. The first and/or second vector can 
also define a nucleotide sequence coding for a 
peptide tag. The tag sequence is typically located 
downstream from the polylinker but upstream from 

20 any stop codon that may be present. In preferred 

embodiments, the vectors contain selectable markers 
such that the presence of a portion of that vector, 
i.e. a particular lambda arm, can be selected for 
or selected against. Typical selectable markers 

25 are well known to those skilled in the art. 

Examples of such markers are antibiotic resistance 
genes, genetically selectable markers, mutation 
suppressors such as amber suppressors and the like. 
The selectable markers are typically located 

30 upstream of the promoter and/or downstream of the 
second restriction site. In preferred embodiments, 
one selectable marker is located upstream of the 
promoter on the first vector containing the V H - 
coding DNA homologs. A second selectable marker is 

35 located downstream of the second restriction site 
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on the vector containing the V L -coding DNA homologs. 
This second selectable marker may be the same or 
different from the first as long as when the V H - 
coding vectors and the V L -coding vectors are 
5 randomly combined via the first restriction site 

the resulting vectors containing both V H and V L and 
both selectable markers can be selected. 

Typically the polylinker is a nucleotide 
sequence that defines one or more, preferably at 
10 least two, restriction sites, each unique to the 
vector and preferably not shared by the other 
vector, i.e., if it is on the first vector, it is 
not on the second vector. The polylinker 
restriction sites are oriented to permit ligation 
15 of V H - or V L -coding DNA homologs into the vector in 
same reading frame as any leader, tag or stop codon 
sequence present. 

Random combination is accomplished by 
ligating V H -coding DNA homologs into the first 
20 vector, typically at a restriction site or sites 
within the polylinker. Similarly, V L -coding DNA 
homologs are ligated into the second vector, 
thereby creating two diverse populations of 
expression vectors. It does not matter which type 
25 of DNA homolog, i.e., V„ or V L , is ligated to which 
vector, but it is preferred, for example, that all 
V H -coding DNA homologs are ligated to either the 
first or second vector, and all of the V L -coding DNA 
homologs are ligated to the other of the first or 
30 second vector. The members of both populations are 
then cleaved with an endonuclease at the shared 
restriction site, typically by digesting both 
populations with the same enzyme. The resulting 
product is two diverse populations of restriction 
35 fragments where the members of one have cohesive 
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termini complementary to the cohesive termini of 
the members of the other. The restriction 
fragments of the two populations are randomly 
ligated to one another, i.e., a random, 
interpopulation ligation is performed, to produce a 
diverse population of vectors each having a V H - 
coding and V L -coding DNA homolog located in the same 
reading frame and under the control of second 
vector's promoter. Of course, subsequent 
recombinations can be effected through cleavage at 
the shared restriction site, which is typically 
reformed upon ligation of members from the two 
populations, followed by subsequent religations. 

The resulting construct is then 
introduced into an appropriate host to provide 
amplification and/or expression of the V H - and/or 
V L -coding DNA homologs, either separately or in 
combination. When coexpressed within the same 
organism, either on the same or the different 
vectors, a functionally active Fv is produced. 
When the V H and V L polypeptides are expressed in 
different organisms, the respective polypeptides 
are isolated and then combined in an appropriate 
medium to form a Fv. Cellular hosts into which a 
V H - and/or V L -coding DNA homolog-containing 
construct has been introduced are referred to 
herein as having been "transformed" or as 
" trans formants 11 . 

The host cell can be either prokaryotic 
or eukaryotic. Bacterial cells are preferred 
prokaryotic host cells and typically are a strain 
of E. coli such as, for example, the E. coli strain 
DH5 available from Bethesda Research Laboratories, 
Inc., Bethesda, MD. Preferred eukaryotic host 
cells include yeast and mammalian cells, preferably 
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vertebrate cells such as those from a mouse, rat, 
monkey or human cell line. 

Transformation of appropriate cell hosts 
with a recombinant DNA molecule of the present 
5 invention is accomplished by methods that typically 
depend on the type of vector used. With regard to 
transformation of prokaryotic host cells, see, for 
example, Cohen et al., Proc. Natl. Acad. Sci. , USA, 
69:2110 (1972); and Maniatis et al., Molecular 

10 Cloning; A Laboratory Manual . Cold Spring Harbor, 
NY (1982) . With regard to the transformation of 
vertebrate cells with retroviral vectors containing 
rDNAs , see for example, Sorge et al., Mol. Cell. 
Biol. , 4:1730-1737 (1984); Graham et al., Virol. , 

15 52:456 (1973); and Wigler et al., Proc. Natl. Acad. 
Sci. . USA, 76:1373-1376 (1979). 

5. Screening For Expression of V H and/or 
V L Polypeptides 

20 Successfully transformed cells, i.e., 

cells containing a V H - and/or V L -coding DNA homolog 
operatively linked to a vector, can be identified 
by any suitable well known technique for detecting 
the binding of a receptor to a ligand or the 

25 presence of a polynucleotide coding for the 

receptor, preferably its active site. Preferred 
screening assays are those where the binding of 
ligand by the receptor produces a detectable 
signal, either directly or indirectly. Such 

30 signals include, for example, the production of a 
complex, formation of a catalytic reaction product, 
the release or uptake of energy, and the like. For 
example, cells from a population subjected to 
transformation with a subject rDNA can be cloned to 

35 produce monoclonal colonies. Cells form those 
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colonies can be harvested, lysed and their DNA 
content examined for the presence of the rDNA using 
a method such as that described by Southern, J, 
Mol. Biol.. 98:503 (1975) or Berent et al., 
Biotech . . 3;208 (1985). 

In addition to directly assaying for the 
presence of a V H - and/ or V L -coding DNA homolog, 
successful transformation can be confirmed by well 
known immunological methods , especially when the V H 
and/or V L polypeptides produced contain a 
preselected epitope. For example, samples of cells 
suspected of being transformed are assayed for the 
presence of the preselected epitope using an 
antibody against the epitope. 

6. Vu- And/Or V t -Coding Gene Libraries 
The present invention contemplates a gene 
library, preferably produced by a primer extension 
reaction or combination of primer extension 
reactions as described herein, containing at least 
about 10 3 , preferably at least about 10 4 and more 
preferably at least about 10 5 different V H - and/or 
V L -coding DNA homologs. The homologs are preferably 
in an isolated form, that is, substantially free of 
materials such as, for example, primer extension 
reaction agents and/or substrates, genomic DNA 
segments, and the like. 

In preferred embodiments, a substantial 
portion of the homologs present in the library are 
operatively linked to a vector, preferably 
operatively linked for expression to an expression 
vector. 

Preferably , the homologs are present in a 
medium suitable for in vitro manipulation, such as 
water, water containing buffering salts, and the 
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like. The medium should be compatible with 
maintaining the biological activity of the 
homologs. In addition, the homologs should be 
present at a concentration sufficient to allow 
5 transformation of a host cell compatible therewith 
at reasonable frequencies. 

It is further preferred that the homologs 
be present in compatible host cells transformed 
therewith. 

10 

D. Expressio n Vectors 
The present invention also contemplates 
various expression vectors useful in performing, 
inter alia , the methods of the present invention. 
15 Each of the expression vectors is a novel 
derivative of Lambda Zap. 

1. Lambda Zap II 

Lambda Zap II is prepared by 
replacing the Lambda S gene of the vector Lambda 
20 Zap with the Lambda S gene from the Lambda gtlO 
vector, as described in Example 6. 

2. Lambda Zap II V u 

Lambda Zap II V„ is prepared by 
inserting the synthetic DNA sequences illustrated 

25 in Figure 6A into the above-described Lambda Zap II 
vector. The inserted nucleotide sequence 
advantageously provides a ribosome binding site 
(Shine-Dalgarno sequence) to permit proper 
imitation of mRNA translation into protein, and a 

30 leader sequence to efficiently direct the 
translated protein to the periplasm. The 
preparation of Lambda Zap II V„ is described in more 
detail in Example 9, and its features illustrated 
in Figures 6A and 7. 

35 3. Lambda Zap II V. 
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Lambda Zap II V L is prepared as 
described in Example 12 by inserting into Lambda 
Zap II the synthetic DNA sequence illustrated in 
Figure 6B. Important features of Lambda Zap II V L 
are illustrated in Figure 8. 

4. Lambda Zap II V L II 
Lambda Zap II V L II is prepared as 
described in Example 11 by inserting into Lambda 
Zap II the synthetic DNA sequence illustrated in 
Figure 10. 

The above-described vectors are 
compatible with E. coli hosts, i.e., they can 
express for secretion into the periplasm proteins 
coded for by genes to which they have been 
operatively linked for expression. 

Examples 

The following examples are intended to 
illustrate, but not limit, the scope of the 
invention. 

1. Polynucleotide Selection 
The nucleotide sequences encoding the 
immunoglobulin protein CDR's are highly variable. 
However, there are several regions of conserved 
sequences that flank the V H domains. For instance, 
contain substantially conserved nucleotide 
sequences, i.e., sequences that will hybridize to 
the same primer sequence. Therefore, 
polynucleotide synthesis (amplification) primers 
that hybridize to the conserved sequences and 
incorporate restriction sites into the DNA homolog 
produced that are suitable for operatively linking 
the synthesized DNA fragments to a vector were 
constructed. More specifically, the DNA homologs 
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were inserted into Lambda ZAP II vector (Stratagene 
Cloning System, La Jolla, CA) at the Xho I and EcoR 
I sites. For amplification of the V H domains, the 
3' primer (primer 12 in Table 1), was designed to 
be complementary to the mRNA in the J H region . In 
all cases, the 5' primers (primers 1-10, Table 1) 
were chosen to be complementary to the first strand 
cDNA in the conserved N-terminus region (antisense 
strand) • Initially amplification was performed 
with a mixture of 32 primers (primer 1, Table 1) 
that were degenerate at five positions. Hybridoma 
mRNA could be amplified with mixed primers, but 
initial attempts to amplify mRNA from spleen 
yielded variable results. Therefore, several 
alternatives to amplification using the mixed 5' 
primers were compared. 

The first alternative was to construct 
multiple unique primers, eight of which are shown 
in Table 1, corresponding to individual members of 
the mixed primer pool. The individual primers 2-9 
of Table 1 were constructed by incorporating either 
of the two possible nucleotides at three of the 
five degenerate positions. 

The second alternative was to construct a 
primer containing inosine (primer 10, Table 1) at 
four of the variable positions based on the 
published work of Takahashi, et al., Proc. Natl, 
Acad, Sci. (U.S.A.) , 82:1931-1935, (1985) and 
Ohtsuka et al., J. Biol. Chem. . 260: 2605-2608, 
(1985) . This primer has the advantage that it is 
not degenerate and, at the same time minimizes the 
negative effects of mismatches at the unconserved 
positions as discussed by Martin et al . , Nuc. Acids 
Res. . 13:8927 (1985). However, it was not known if 
the presence of inosine nucleotides would result in 
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incorporation of unwanted sequences in the cloned V H 
regions. Therefore, inosine was not included at 
the one position that remains in the amplified 
fragments after the cleavage of the restriction 
sites. As a result, inosine was not in the cloned 
insert. 

Additional V H amplification primers 
including the unique 3' primer were designed to be 
complementary to a portion of the first constant 
region domain of the gamma 1 heavy chain mRNA 
(primers 15 and 16 , Table 1). These primers will 
produce DNA homologs containing polynucleotides 
coding for amino acids from the V H and the first 
constant region domains of the heavy chain. These 
DNA homologs can therefore be used to produce Fab 
fragments rather than an F v . 

Additional unique 3' primers designed to 
hybridize to similar regions of another class of 
immunoglobulin heavy chain such as IgM, IgE and IgA 
are contemplated. Over 3 • primers that hybridize 
to a specific region of a specific class of CH^ 
constant region and are adapted for transferring 
the V H domains amplified using this primer to an 
expression vector capable of expressing those V H 
domains with a different class of heavy or light 
chain constant region is also contemplated. 

As a control for amplification from 
spleen or hybridoma mRNA, a set of primers 
hybridizing to a highly conserved region within the 
constant region IgG, heavy chain gene were 
constructed. The 5 1 primer (primer 11, Table 1) is 
complementary to the cDNA in the C H 2 region whereas 
the 3 1 primer (primer 13, Table 1) is complementary 
to the mRNA in the C H 3 region. It is believed that 
no mismatches were present between these primers 
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and their templates. 

The nucleotide sequences encoding the V L 
CDRs are highly variable. However, there are 
several regions of conserved sequences that flank 
the V L CDR domains including the J L , V L framework 
regions and V L leader/promo tor. Therefore, 
amplification primers that hybridize to the 
conserved sequences and incorporate restriction 
sites that allowing cloning the amplified fragments 
into the pBluescript SK- vector cut with Nco I and 
Spe I were constructed. For amplification of the V L 
CDR domains, the 3' primer (primer 14 in Table 1), 
was designed to be complementary to the mRNA in the 
J L regions. The 5" primer (primer 15, Table 1) was 
chosen to be complementary to the first strand cDNA 
in the conserved N-terminus region (antisense 
strand) . 

A second set of amplification primers for 
amplification of the V L CDR domains the 5 1 primers 
(primers 1-8 in Table 2) were designed to be 
complementary to the first strand cDNA in the 
conserved N-terminus region. These primers also 
introduced a Sac I restriction endonuclease site to 
allow the V L DNA homolog to be cloned into the V L II- 
expression vector. The 3 1 V L amplification primer 
(primer 9 in Table 2) was designed to be 
complementary to the mRNA in the J L regions and to 
introduce the Xba I restriction endonuclease site 
required to insert the V L DNA homolog into the V L II- 
expression vector (Figure a) . 

Additional 3' V L amplification primers 
were designed to hybridize to the constant region 
of either kappa or lambda mRNA (primers 10 and 11 
in Table 2) . These primers allow a DNA homolog to 
be produced containing polynucleotide sequences 
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coding for constant region amino acids of either 
kappa or lambda chain. These primers make it 
possible to produce an Fab fragment rather than an 

Fy. 

The primers used for amplification of 
kappa light chain sequences for construction of 
Fabs are shown at least in Table 2. Amplification 
with these primers was performed in 5 separate 
reactions, each containing one of the 5' primers 
(primers 3-6, and 12) and one of the 3 1 primers 
(primer 13). The remaining 3 1 primer (primer 9) 
has been used to construct F v fragments. The 5' 
primers contain a Sac I restriction site and the 3 1 
primers contain a Xba I restriction site. 

The primers used for amplification of 
heavy chain Fd fragments for construction of Fabs 
are shown at least in Table 1. Amplification was 
performed in eight separate reactions, each 
containing one of the 5' primers (primers 2-9) and 
one of the 3' primers (primer 15). The remaining 
5 1 primers that have been used for amplification in 
a single reaction are either a degenerate primer 
(primer 1) or a primer that incorporates inosine at 
four degenerate positions (primer 10, Table 1, and 
primers 17 and 18, Table 2). The remaining 3' 
primer (primer 14, Table 2) has been used to 
construct F v fragments. Many of the 5' primers 
incorporate a Xho I site, and the 3' primers 
incorporate a Spe I restriction site. 

V H amplification primers designed to 
amplify human heavy chain variable regions are 
shown in Table 2. One of the 5 1 heavy chain primer 
contains inosine residues at degenerate nucleotide 
positions allowing a single primer to hybridize to 
a large number of variable region sequences. 
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Primers designed to hybridize to the constant 
region sequences of various IgG mRNAs are also 
shown in Table 2. 

V L amplification primers designed to 
5 amplify human light chain variable regions of both 
the lambda and kappa isotypes are also shown in 
Table 2. 

All primers and synthetic polynucleotides 
used herein and shown on Tables 1-4 were either 
10 purchased from Research Genetics in Huntsville, 

Alabama or synthesized on an Applied Biosystems DNA 
synthesizer, model 381A, using the manufacturer's 
instruction. 
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2 . Pr-nduction Qf A V" C odjpg Repertoire 

Rn ^-iched T " TTTC Bi^Hina Proteins 
Fluorescein isothiocyanate (FITC) was 
selected as a ligand for receptor binding. It was 
further decided to enrich by immunization the 
immunological gene repertoire, i.e., V H - and V t - 
coding gene repertoires, for genes coding for anti- 
FITC receptors. This was accomplished by linking 
FITC to keyhole limpet hemocyanin (KLH) using the 
techniques described in &r>ti bodies A laboratory 
manual . Harlow and Lowe, eds., Cold Spring Harbor, 
NY, (1988). Briefly, 10. 0 milligrams (mg) of 
keyhole limpet hemocyanin and 0.5 mg of FITC were 
added to 1 ml of buffer containing 0.1 M sodium 
carbonate at pH 9.6 and stirred for 18 to 24 hours 
at 4 degrees C (4C) . The unbound FITC was removed 
by gel filtration through Sephadex G-25. 

The KLH-FITC conjugate was prepared for 
injection into mice by adding 100 pg of the 
conjugate to 250 Ml of phosphate buffered saline 
(PBS). An equal volume of complete Freund's 
adjuvant was added and emulsified the entire 
solution for 5 minutes. A 129 G, x+ mouse was 
injected with 300 (jlI of the emulsion. Injections 
were given subcutaneously at several sites using a 
21 gauge needle. A second immunization with KLH- 
FITC was given two weeks later. This injection was 
prepared as follows: fifty ng of KLH-FITC were 
diluted in 250 fxL of PBS and an equal volume of 
alum was admixed to the KLH-FITC solution. The 
mouse was injected intraperitoneal^ with 500 nl of 
the solution using a 23 gauge needle. One month 
later the mice were given a final injection of 50 
fig of the KLH-FITC conjugate diluted to 200 /iL in 
PBS. This injection was given intravenously in the 
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lateral tail vein using a 30 gauge needle. Five 
days after This final injection the mice were 
sacrificed and total cellular RNA was isolated from 
their spleens. 

5 Hybridoma PCP 8D11 producing an antibody 

immunospecific for phosphonate ester was cultured 
in DMEM media (Gibco Laboratories, Grand Island, 
NY) containing 10 percent fetal calf serum 
supplemented with penicillin and streptomycin. 
10 About 5 x 10 8 hybridoma cells were harvested and 
washed twice in phosphate buffered saline. Total 
cellular RNA was prepared from these isolated 
hybridoma cells. 

3- Preparation nf a v„-n» dincr GgQg 
15 Repertoire 

Total cellular RNA was prepared from the 
spleen of a single mouse immunized with KLH-FITC as 
described in Example 2 using the RNA preparation 
methods described by Chomczynski et al., Anal 
20 Bjochem, , 162:156-159 (1987)using the 

manufacturer's instructions and the RNA isolation 
kit produced by Stratagene Cloning Systems, La 
Jolla, CA. Briefly, immediately after removing the 
spleen from the immunized mouse, the tissue was 
25 homogenized in 10 ml of a denaturing solution 

containing 4.0 M guanine isothiocyanate, 0.25 M 
sodium citrate at pH 7.0, and 0.1 M 2- 
mercaptoethanol using a glass homogenizer. One ml 
of sodium acetate at a concentration of 2 M at pH 
30 4.0 was admixed with the homogenized spleen. One 
ml of phenol that had been previously saturated 
with H 2 0 was also admixed to the denaturing solution 
containing the homogenized spleen. Two ml of a 
chloroform: isoamyl alcohol (24:1 v/v) mixture was 
added to this homogenate. The homogenate was mixed 
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vigorously for ten seconds and maintained on ice 
for 15 minutes. The homogenate was then 
transferred to a thick-walled 50 ml polypropylene 
centrifuged tube (Fisher Scientific Company, 
5 Pittsburg, PA) . The solution was centrifuged at 
10 , 000 x g for 20 minutes at 4C. The upper RNA- 
containing aqueous layer was transferred to a fresh 
50 ml polypropylene centrifuge tube and mixed with 
an equal volume of isopropyl alcohol. This 
10 solution was maintained at -20C for at least one 
hour to precipitate the RNA. The solution 
containing the precipitated RNA was centrifuged at 
10,000 x g for twenty minutes at 4C, The pelleted 
total cellular RNA was collected and dissolved in 3 

15 ml of the denaturing solution described above. 

Three ml of isopropyl alcohol was added to the re- 
suspended total cellular RNA and vigorously mixed. 
This solution was maintained at -20C for at least 1 
hour to precipitate the RNA. The solution 

20 containing the precipitated RNA was centrifuged at 
10,000 x g for ten minutes at 4C. The pelleted RNA 
was washed once with a solution containing 75% 
ethanol. The pelleted RNA was dried under vacuum 
for 15 minutes and then re-suspended in dimethyl 

25 pyrocarbonate (DEPC) treated (DEPC-H 2 0) H 2 0. 

Messenger RNA (mRNA) enriched for 
sequences containing long poly A tracts was 
prepared from the total cellular RNA using methods 
described in Molecular Cloning A Laboratory Manual , 

30 Maniatias et al., eds., Cold Spring Harbor, NY, 
(1982) . Briefly, one half of the total RNA 
isolated from a single immunized mouse spleen 
prepared as described above was re-suspended in one 
ml of DEPC-H 2 0 and maintained at 65C for five 

35 minutes. One ml of 2x high salt loading buffer 
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consisting of 100 mM Tris-HCl, 1 M sodium chloride, 
2.0 iM disodium ethylene diamine tetra-acetic acid 
(EDTA) at pH 7.5, and 0.2% sodium dodecyl sulfate 
(SDS) was added to the re-suspended RNA and the 
mixture allowed to cool to room temperature. The 
mixture was then applied to an oligo-dT 
(Collaborative Research Type 2 or Type 3) column 
that was previously prepared by washing the oligo- 
dT with a solution containing 0.1 M sodium 
hydroxide and 5 mM EDTA and then equilibrating the 
column with DEPC-H 2 0. The eluate was collected in a 
sterile polypropylene tube and reapplied to the 
same column after heating the eluate for 5 minutes 
at 65C. The oligo dT column was then washed with 2 
ml of high salt loading buffer consisting of 50 mM 
Tris-HCl at pH 7.5, 500 mM sodium chloride, 1 mM 
EDTA at pH 7.5 and 0.1% SDS. The oligo dT column 
was then washed with 2 ml of 1 X medium salt buffer 
consisting of 50 mM Tris-HCl at pH 7.5, 100 mM 
sodium chloride 1 mM EDTA and 0.1% SDS. The 
messenger RNA was eluted from the oligo dT column 
with 1 ml of buffer consisting of 10 mM Tris-HCl at 
pH 7.5, 1 mM EDTA at pH 7.5 and 0.05% SDS. The 
messenger RNA was purified by extracting this 
solution with phenol/chloroform followed by a 
single extraction with 100% chloroform. The 
messenger RNA was concentrated by ethanol 
precipitation and re-suspended in DEPC H 2 0. 

The messenger RNA isolated by the above 
process contains a plurality of different V H coding 
polynucleotides, i.e., greater than about 10 4 
different V H -coding genes. 

4. Preparation Of A Single V H Coding 

Polynucleotide 
Polynucleotides coding for a single V H 
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were isolated according to Example 3 except total 
cellular RNA was extracted from monoclonal 
hybridoma cells prepared in Example 2. The 
polynucleotides isolated in this manner code for a 
5 single V H . 

5. DNA Homoloa Preparation 
In preparation for PCR amplification, 
mRNA prepared according to the above examples was 
used as a template for cDNA synthesis by a primer 
10 extension reaction. In a typical 50 /il 

transcription reaction, 5-10 ug of spleen or 
hybridoma mRNA in water was first hybridized 
(annealed) with 500 ng (50.0 pmol) of the 3' V„ 
primer (primer 12, Table 1) , at 65C for five 
15 minutes. Subsequently, the mixture was adjusted to 
1.5 mM dATP, dCTP, dGTP and dTTP, 40 mM Tris-HCl at 
pH 8.0, 8 mM MgCl 2 , 50 mM NaCl, and 2 mM spermidine. 
Moloney-Murine Leukemia virus Reverse transcriptase 
(Stratagene Cloning Systems) , 26 units, was added 
20 and the solution was maintained for 1 hour at 37C. 

PCR amplification was performed in a 100 
(jlI reaction containing the products of the reverse 
transcription reaction (approximately 5 ug of the 
cDNA/RNA hybrid) , 300 ng of 3' V H primer (primer 12 
25 of Table 1), 300 ng each of the 5 1 V H primers 

(primer 2-10 of Table 1) 200 mM of a mixture of 
dNTP's, 50 mM KC1, 10 mM Tris-HCl pH 8.3, 15 mM 
MgCl 2 , 0.1% gelatin and 2 units of Taq DNA 
polymerase. The reaction mixture was overlaid with 
30 mineral oil and subjected to 40 cycles of 

amplification. Each amplification cycle involved 
denaturation at 92C for 1 minute, annealing at 52C 
for 2 minutes and polynucleotide synthesis by 
Primer extension (elongation) at 72C for 1.5 
35 minutes. The amplified V H -coding DNA homolog 
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containing samples were extracted twice with 
phenol/chloroform, once with chloroform, ethanol 
precipitated and were stored at -70C in 10 mM Tris- 
HC1, (pH, 7.5) and 1 mM EDTA. 

Using unique 5 1 primers (2-9, Table 1), 
efficient V H -coding DNA homolog synthesis and 
amplification from the spleen mRNA was achieved as 
shown in Figure 3, lanes R17-R24. The amplified 
cDNA (V H -coding DNA homolog) is seen as a major band 
of the expected size (360 bp) . The intensities of 
the amplified V H -coding polynucleotide fragment in 
each reaction appear to be similar, indicating that 
all of these primers are about equally efficient in 
initiating amplification. The yield and quality of 
the amplification with these primers was 
reproducible. 

The primer containing inosine also 
synthesized amplified V H -coding DNA homologs from 
spleen mRNA reproducibly, leading to the production 
of the expected sized fragment, of an intensity 
similar to that of the other amplified cDNAs 
(Figure 4, Lane R16) . This result indicated that 
the presence of inosine also permits efficient DNA 
homolog synthesis and amplification. Clearly 
indicating how useful such primers are in 
generating a plurality of V H -coding DNa homologs. 
Amplification products obtained from the constant 
region primers (primers 11 and 13, Table 1) were 
more intense indicating that amplification was more 
efficient, possibly because of a higher degree of 
homology between the template and primers (Figure 
4 , Lane R9 ) . Based on these results , a V H -coding 
gene library was constructed from the products of 
eight amplifications, each performed with a 
different 5" primer. Equal portions of the 
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products from each primer extension reaction were 
mixed and the mixed product was then used to 
generate a library of V H -coding DNA homolog- 
containing vectors. 
5 DNA homologs of the V L were prepared from 

the purified mRNA prepared as described above. In 
preparation for PCR amplification , mRNA prepared 
according to the above examples was used as a 
template for cDNA synthesis. In a typical 50 jxl 

10 transcription reaction, 5-10 ug of spleen or 

hybridoma mRNA in water was first annealed with 300 
ng (50.0 pmol) of the 3 1 V L primer (primer 14, Table 
1), at 65C for five minutes. Subsequently, the 
mixture was adjusted to 1.5 mM dATP, dCTP, dGTP, 

15 and dTTP, 40 mM Tris-HCl at pH 8.0, 8 mM MgCl 2 , 50 
mM NaCl, and 2 mM spermidine. Moloney-Murine 
Leukemia virus reverse transcriptase (Stratagene 
Cloning Systems) , 26 units, was added and the 
solution was maintained for 1 hour at 37C. The PCR 

20 amplification was performed in a 100 fil reaction 
containing approximately 5 ug of the cDNA/RNA 
hybrid produced as described above, 300 ng of the 
3' V L primer (primer 14 of Table 1), 300 ng of the 
5' V L primer (primer 15 of Table 1), 200 mM of a 

25 mixture of dNTP's, 50 mM KC1, 10 mM Tris-HCl pH 
8.3, 15 mM MgCl 2 , 0.1% gelatin and 2 units of Taq 
DNA polymerase. The reaction mixture was overlaid 
with mineral oil and subjected to 40 cycles of 
amplification. Each amplification cycle involved 

30 denaturation at 92C for 1 minute, annealing at 52C 
for 2 minutes and elongation at 72C for 1.5 
minutes. The amplified samples were extracted 
twice with phenol/chloroform, once with chloroform, 
ethanol precipitated and were stored at -70C in 10 

35 mM Tris-HCl at 7.5 and 1 mM EDTA. 
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6. Inserting DNA Homoloqs Into Vectors 
In preparation for cloning a library- 
enriched in V H sequences, PCR amplified products 
(2.5 mg/30 pi of 150 mM NaCl, 8 mM Tris-HCl (pH 
5 7,5), 6 mM MgS0 4 , 1 mM DTT, 200 mg/ml bovine serum 
albumin (BSA) at 37C were digested with restriction 
enzymes Xho I (125 units) and EcoR I (10 U) and 
purified on a 1% agarose gel. In cloning 
experiments which required a mixture of the 

10 products of the amplification reactions, equal 
volumes (50 pi, 1-10 ug concentration) of each 
reaction mixture were combined after amplification 
but before restriction digestion. After gel 
electrophoresis of the digested PCR amplified 

15 spleen mRNA, the region of the gel containing DNA 
fragments of approximately 350 bps was excised, 
electro-eluted into a dialysis membrane, ethanol 
precipitated and re-suspended in 10 mM Tris-HCl pH 
7.5 and 1 mM EDTA to a final concentration of 10 

20 ng//xl. Equimolar amounts of the insert were then 
ligated overnight at 5C to 1 ug of Lambda ZAP™ II 
vector (Stratagene Cloning Systems, La Jolla, CA) 
previously cut by EcoR I and Xho I. A portion of 
the ligation mixture (1 /il) was packaged for 2 

25 hours at room temperature using Gigapack Gold 

packaging extract (Stratagene Cloning Systems, La 
Jolla, CA) , and the packaged material was plated on 
XLl-blue host cells. The library was determined to 
consist of 2 x 10 7 V H homologs with less than 30% 

30 non-recombinant background. 

The vector used above, Lambda Zap II is a 
derivative of the original Lambda Zap (ATCC # 
40,298) that maintains all of the characteristics 
of the original Lambda Zap including 6 unique 

35 cloning sites, fusion protein expression, and the 
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ability to rapidly excise the insert in the form of 
a phagemid (Bluescript SK-) , but lacks the SAM 100 
mutation, allowing growth on many Non-Sup F 
strains, including XLl-Blue. The Lambda Zap II was 
5 constructed as described in Short et al . , Nucleic 
Acids Res, . 16:7583-7600, 1988, by replacing the 
Lambda S gene contained in a 4254 base pair (bp) 
DNA fragment produced by digesting Lambda Zap with 
the restriction enzyme Ncol. This 4254 bp DNA 

10 fragment was replaced with the 4254 bp DNA fragment 
containing the Lambda S gene isolated from Lambda 
gtlO (ATCC # 40,179) after digesting the vector 
with the restriction enzyme Ncol. The 4254 bp DNA 
fragment isolated from lambda gtlO was ligated into 

15 the original Lambda Zap vector using T4 DNA ligase 
and standard protocols for such procedures 
described in Current Protocols in Molecular 
Biology . Ausubel et al., eds., John Wiley and Sons, 
NY, 1987. 

20 In preparation of cloning a library 

enriched in V L sequences, 2 ug of PCR amplified 
products (2.5 mg/30 nl of 150 mM NaCl, 8 mM Tris- 
HC1 (pH 7.5), 6 mM Mg S0 4 , 1 mM DTT, 200 mg/ml BSA. 
37C) were digested with restriction enzymes Nco I 

25 (30 units) and Spe I (45 units) . The digested PCR 

amplified products were purified on a 1% agarose 
gel using standard electro-elution technique 
described in Molecular Cloning A Laboratory Manual. 
Maniatis et al., eds., Cold Spring Harbor, NY, 

30 (1982) . Briefly, after gel electro-elution of the 

digested PCR amplified product the region of the 
gel containing the V L -coding DNA fragment of the 
appropriate size was excised, electro-elution into 
a dialysis membrane, ethanol precipitated and re- 

35 suspended at a final concentration of 10 ng per ml 
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in a solution containing 10 mM Tris-HCl at pH 7.5 
and 1 mM EDTA. 

An equal molar amount of DNA representing 
a plurality of different V L -coding DNA homologs was 
5 ligated to a pBluescript SK- phagemid vector that 
had been previously cut with Nco I and Spe I. A 
portion of the ligation mixture was transformed 
using the manufacturer's instructions into Epicuian 
Coli XLl-Blue competent cells (Stratagene Cloning 

10 Systems , La Jolla, CA) . The transformant library 
was determined to consist of 1.2 x 10 3 colony 
forming units/ug of V L homologs with less than 3% 
non-recombinant background. 

7. Sequencing of Plasmids from the V H - 

15 Coding cDNA Library 

To analyze the Lambda Zap II phage clones 
the clones were excised from Lambda Zap into 
plasmids according to the manufacture's 
instructions (Stratagene Cloning System, La Jolla, 

20 CA) . Briefly, phage plaques were cored from the 
agar plates and transferred to sterile microfuge 
tubes containing 500 nl a buffer containing 50 mM 
Tris-HCl at pH 7.5, 100 mM NaCl, 10 mM MgS0 4 , and 
0.01% gelatin and 20 txl of chloroform. 

25 For excisions, 200 pi of the phage stock, 

200 fxl of XLl-Blue cells (A^q = 1.00) and 1 Ml of 
R408 helper phage (1 x 10 11 pfu/ml) were incubated 
at 37C for 15 minutes. The excised plasmids were 
infected into XLl-Blue cells and plated onto LB 

30 plates containing ampicillin. Double stranded DNA 
was prepared from the phagemid containing cells 
according to the methods described by Holmes et 
al. f Anal. Biochem. . 114;193, (1981). Clones were 
first screened for DNA inserts by restriction 

35 digests with either Pvu II or Bgl I and clones 
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containing the putative V H insert were sequenced 
using reverse transcriptase according to the 
general method described by Sanger et al . , Proc. 
Natl. Acad. Sci.. USA . 74:5463-5467, (1977) and the 
5 specific modifications of this method provided in 
the manufacturer's instructions in the AMV reverse 
transcriptase 35 S-dATP sequencing kit from 
Stratagene Cloning Systems, La Jolla, CA. 

8. Characterization Of The Cloned V H 

10 Repertoire 

The amplified products which had been 
digested with Xho I and EcoR I and cloned into 
Lambda ZAP, resulted in a cDNA library with 9.0 x 
10 5 pfu f s. In order to confirm that the library 

15 consisted of a diverse population of V H -coding DNA 
homologs, the N-terminal 120 bases of 18 clones, 
selected at random from the library, were excised 
and sequenced (Figure 5) . To determine if the 
clones were of V H gene origin, the cloned sequences 

20 were compared with known V H sequences and V L 

sequences. The clones exhibited from 80 to 90% 
homology with sequences of known heavy chain origin 
and little homology with sequences of light chain 
origin when compared with the sequences available 

25 in Sequences of Prote ins of Immunological Interest 
by Kabot et al., 4th ed., U.S. Dept. of Health and 
Human Sciences, (1987). This demonstrated that the 
library was enriched for the desired V H sequence in 
preference to other sequences, such as light chain 

30 sequences. 

The diversity of the population was 
assessed by classifying the sequenced clones into 
predefined subgroups (Figure 5) . Mouse V H sequences 
are classified into eleven subgroups (Figure 5) . 
35 Mouse V H sequences are classified into eleven 
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subgroups [I (A,B,), II (A,B,C), III (A,B,C,D,) V 
(A, B) ] based on framework amino acid sequences 
described in Sequences of Proteins of Immunological 
Interest by Kabot et al., 4th ed. , U.S. Dept. of 
Health and Human Sciences, (1987) ; Dildrop, 
T-mmvmnl oqv Today . 5:84, (1984); and Brodeur et al., 
Eur. J, Immunol. . 14; 922, (1984). Classification 
of the sequenced clones demonstrated that the cDNA 
library contained V H sequences of at least 7 
different subgroups. Further, a pairwise 
comparison of the homology between the sequenced 
clones showed that no two sequences were identical 
at all positions, suggesting that the population is 
diverse to the extent that it is possible to 
characterize by sequence analysis. 

Six of the clones (L 36-50, Figure 5) 
belong to the subclass III B and had very similar 
nucleotide sequences. This may reflect a 
preponderance of mRNA derived from one or several 
related variable genes in stimulated spleen, but 
the data does not permit ruling out the possibility 
of a bias in the amplif ication process. 

9. V H -Expression Vector Construction 
The main criterion used in choosing a 
vector system was the necessity of generating the 
largest number of Fab fragments which could be 
screened directly. Bacteriophage lambda was 
selected as the expression vector for three 
reasons. First, in vitro packaging of phage DNA is 
the most efficient method of reintroducing DNA into 
host cells. Second, it is possible to detect 
protein expression at the level of single phage 
plaques. Finally, the screening of phage libraries 
typically involve less difficulty with nonspecific 
binding. The alternative, plasmid cloning vectors, 
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are only advantageous in the analysis of clones 
after they have been identified. This advantage is 
not lost in the present system because of the use 
of lambda zap, thereby permitting a plasmid 
5 containing the heavy chain, light chain, or Fab 
expressing inserts to be excised. 

To express the plurality of V H -coding DNA 
homologs in an E. coli host cell, a vector was 
constructed that placed the V H -coding DNA homologs 

10 in the proper reading frame, provided a ribosome 

binding site as described by Shine et al., Nature, 
254:34, 1975, provided a leader sequence directing 
the expressed protein to the periplasmic space, 
provided a polynucleotide sequence that coded for a 

15 known epitope (epitope tag) and also provided a 
polynucleotide that coded for a spacer protein 
between the V H -coding DNA homolog and the 
polynucleotide coding for the epitope tag. A 
synthetic DNA sequence containing all of the above 

20 polynucleotides and features was constructed by 

designing single stranded polynucleotide segments 
of 20-40 bases that would hybridize to each other 
and form the double stranded synthetic DNA sequence 
shown in Figure 6. The individual single-stranded 

25 polynucleotides (N^Nu) are shown in Table 3. 

Polynucleotides 2, 3, 9-4', 11, 10-5', 6, 
7 and 8 were kinased by adding 1 Ml of each 
polynucleotide (0.1 ug/fil) and 20 units of T 4 
polynucleotide kinase to a solution containing 70 

30 mM Tris-HCl at pH 7.6, 10 mM MgCl 2 , 5 mM DTT, 10 mM 
2ME, 500 micrograms per ml of BSA. The solution 
was maintained at 37C for 30 minutes and the 
reaction stopped by maintaining the solution at 65C 
for 10 minutes • The two end polynucleotides 20 ng 

35 of polynucleotides Nl and polynucleotides N12, were 
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added to the above kinasing reaction solution 
together with 1/10 volume of a solution containing 
20.0 mM Tris-HCl at pH 7.4, 2.0 mM MgCl 2 and 50.0 mM 
NaCl. This solution was heated to 70C for 5 
minutes and allowed to cool to room temperature, 
approximately 25C, over 1.5 hours in a 500 ml 
beaker of water. During this time period all 10 
polynucleotides annealed to form the double 
stranded synthetic DNA insert shown in Figure 6A. 
The individual polynucleotides were covalently 
linked to each other to stabilize the synthetic DNA 
insert by adding 40 (il of the above reaction to a 
solution containing 50 mM Tris-HCl at pH 7.5, 7 mM 
MgCl 2 , 1 mM DTT, 1 mM adenosine triphosphate (ATP) 
and 10 units of T4 DNA ligase. This solution was 
maintained at 37C for 30 minutes and then the T4 
DNA ligase was inactivated by maintaining the 
solution at 65C for 10 minutes. The end 
polynucleotides were kinased by mixing 52 /il of the 
above reaction, 4 (il of a solution containing 10 mM 
ATP and 5 units of T4 polynucleotide kinase. This 
solution was maintained at 37C for 30 minutes and 
then the T4 polynucleotide kinase was inactivated 
by maintaining the solution at 65C for 10 minutes. 
The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector that had been 
previously digested with the restriction enzymes 
Not I and Xho I. The ligation mixture was packaged 
according to the manufactured instructions using 
Gigapack II Gold packing extract available from 
Stratagene Cloning Systems, La Jolla, CA. The 
packaged ligation mixture was plated on XL1 blue 
cells (Stratagene Cloning Systems, San Diego, CA) . 
Individual Lambda Zap II plaques were cored and the 
inserts excised according to the in vivo excision 
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protocol provided by the manufacturer, stratagene 
Cloning Systems, La Jolla, CA. This in vivo 
excision protocol moves the cloned insert from the 
Lambda Zap II vector into a plasmid vector to allow 
5 easy manipulation and sequencing. The accuracy of 
the above cloning steps was confirmed by sequencing 
the insert using the Sanger dideoxide method 
described in by Sanger et al • , Proc. Natl, Acad, 
Sci USA . 74:5463-5467, (1977) and using the 
10 manufactured instructions in the AMV Reverse 

Transcriptase 35 S-ATP sequencing kit from Stratagene 
Cloning Systems, La Jolla, CA. The sequence of the 
resulting V H expression vector is shown in Figure 6A 
and Figure 7. 







Table 3 




Nl) 


5' 


GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 • 




N2) 


5' 


AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 


1 


N3) 


5' 


GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 • 




N4) 


5' 


AGGTGAAACTGCTCGAGAATTCTAGACTAGGTTAATAG 


3' 


N5) 


5' 


TCGACTATTAACTAGTCTAGAATTCTCGAG 3 • 




N6) 


5' 


CAGTTTCACCTGGGCCATGGCTGGTTGGG 3 1 




N7) 


5' 


CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3 1 


N8) 


5' 


GTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGC 


3« 


N9-4) 


5' 


AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3 • 


Nil) 


5' 


GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 • 




N12) 


5» 


TCGACGAATTCTATTAAGAACCGTAGTC 3 • 




N10-5) 


5' 


CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 


3' 






10. V. Exoression Vector Construction 





30 To express the plurality of V L coding 

polynucleotides in an E. coli host cell, a vector was 
constructed that placed the V L coding polynucleotide in 
the proper reading frame, provided a ribosome binding 
site as described by Shine et al., Nature , 254:34, 

35 (1975), provided a leader sequence directing the 
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expressed protein to the periplasmic space and also 
provided a polynucleotide that coded for a spacer protein 
between the V L polynucleotide and the polynucleotide 
coding for the epitope tag. A synthetic DNA sequence 
5 containing all of the above polynucleotides and features 
was constructed by designing single stranded 
polynucleotide segments of 20-40 bases that would 
hybridize to each other and form the double stranded 
synthetic DNA sequence shown in Figure 6B. The 

10 individual single-stranded polynucleotides (N^Nq) are 
shown in Table 3. 

Polynucleotides N2, N3, N4, N6, N7 and N8 were 
kinased by adding 1 jil of each polynucleotide and 20 
units of T* polynucleotide kinase to a solution containing 

15 70 mM Tris-HCl at pH 7.6, 10 mM MgCl 2 , 5 mM DDT, 10 mM 
2 ME, 500 micrograms per ml of BSA. The solution was 
maintained at 37C for 30 minutes and the reaction stopped 
by maintaining the solution at 65C for 10 minutes. The 
two end polynucleotides 20 ng of polynucleotides Nl. and 

20 polynucleotides N5 were added to the above kinasing 

reaction solution together with 1/10 volume of a solution 
containing 20.0 mM Tris-HCl at pH 7.4, 2.0 mM MgCl 2 and 
50.0 mM NaCl. This solution was heated to 70 C for 5 
minutes and allowed to cool to room temperature, 

25 approximately 25C, over 1.5 hours in a 500 ml beaker of 
water. During this time period all the polynucleotides 
annealed to form the double stranded synthetic DNA 
insert. The individual polynucleotides were covalently 
linked to each other to stabilize the synthetic DNA 

30 insert with adding 40 nl of the above reaction to a 

solution containing 50 til Tris-HCl at pH 7.5, 7 mM MgCl 2 , 
1 mM DTT, 1 mM ATP and 10 units of T4 DNA ligase. This 
solution was maintained at 37C for 30 minutes and then 
the T4 DNA ligase was inactivated by maintaining the 

35 solution at 65C for 10 minutes. The end polynucleotides 
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were kinased by mixing 52 fil of the above reaction, 4 fil 
of a solution recontaining 10 mM ATP and 5 units of T4 
polynucleotide kinase. This solution was maintained at 
37C for 30 minutes and then the T4 polynucleotide kinase 
5 was inactivated by maintaining the solution at 65C for 10 
minutes. The completed synthetic DNA insert was ligated 
directly into a Lambda Zap II vector that had been 
previously digested with the restriction enzymes Not I 
and Xho I. The ligation mixture was packaged according 

10 to the manufactured instructions using Gigapack II Gold 
packing extract available from Stratagene Cloning 
Systems, La Jolla, CA. The packaged ligation mixture was 
plated on XLl-Blue cells (Stratagene Cloning Systems, La 
Jolla, CA). Individual lambda Zap II plaques were cored 

15 and the inserts excised according to the in vivo excision 
protocol provided by the manufacturer, Stratagene Cloning 
Systems, La Jolla, CA and described in Short et al., 
Nucleic Acids Res. . 16:7583-7600, 1988. This in vivo 
excision protocol moves the cloned insert from the Lambda 

20 Zap II vector into a phagemid vector to allow easy 
manipulation and sequencing and also produces the 
phagemid version of the V L expression vectors. The 
accuracy of the above cloning steps was confirmed by 
sequencing the insert using the Sanger dideoxide method 

25 described by Sanger et al., Proc. Nat l. Acad, Aci. USA. 
74:5463-5467, (1977) and using the manufacturer's 
instructions in the AMV reverse transcriptase 35 S-dATP 
sequencing kit from Stratagene Cloning Systems, La Jolla, 
CA. The sequence of the resulting V L expression vector is 

30 shown in Figure 6 and Figure 8. 

The V|_ expression vector used to construct the 
V L library was the phagemid produced to allow the DNA of 
the V L expression vector to be determined. The phagemid 
was produced, as detailed above, by the in vivo excision 

35 process from the Lambda Zap V L expression vector (Figure 
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8) . The phagemid version of this vector was used because 
the Nco I restriction enzyme site is unique in this 
version and thus could be used to operatively linked the 
V L DNA homologs into the expression vector. 
5 11. V t II-Expression Vector Construction 

To express the plurality of V L -coding DNA 
homologs in an E. coli host cell, a vector was 
constructed that placed the V L -coding DNA homologs in the 
proper reading frame, provided a ribosome binding site as 

10 described by Shine et al., Nature , 254:34, 1975, provided 

the Pel B gene leader sequence that has been previously 
used to successfully secrete Fab fragments in E. coli by 
Lei et al., J. Bac. . 169:4379 (1987) and Better et al., 
Science, 240:1041 (1988), and also provided a 

15 polynucleotide containing a restriction endonuclease site 
for cloning. A synthetic DNA sequence containing all of 
the above polynucleotides and features was constructed by 
designing single stranded polynucleotide segments of 20- 
60 bases that would hybridize to each other and form the 

20 double stranded synthetic DNA sequence shown in Figure 
10. The sequence of each individual single-stranded 
polynucleotides (01-08) within the double stranded 
synthetic DNA sequence is shown in Table 4. 

Polynucleotides 02, 03, 04, 05, 06 and 07 were 

25 kinased by adding 1 pi (0.1 ug/^1) of each polynucleotide 
and 20 units of T 4 polynucleotide kinase to a solution 
containing 70 mM Tris-HCl at pH 7.6, 10 mM magnesium 
chloride (MgCl) , 5 mM dithiothreitol (DTT) , 10 mM 2- 
mercaptoethanol (2ME) , 500 micrograms per ml of bovine 

30 serum albumin. The solution was maintained at 37C for 30 
minutes and the reaction stopped by maintaining the 
solution at 65C for 10 minutes. The 20 ng each of the 
two end polynucleotides, 01 and 08, were added to the 
above kinasing reaction solution together with 1/10 

35 volume of a solution containing 20.0 mM Tris-HCl at pH 
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7.4, 2.0 mM MgCl and 15.0 mM sodium chloride (NaCl) . 
This solution was heated to 70C for 5 minutes and allowed 
to cool to room temperature, approximately 25C, over 1.5 
hours in a 500 ml beaker of water. During this time 
5 period all 8 polynucleotides annealed to form the double 
stranded synthetic DNA insert shown in Figure 9. The 
individual polynucleotides were covalently linked to each 
other to stabilize the synthetic DNA insert by adding 40 
jxl of the above reaction to a solution containing 50 ml 

10 Tris-HCl at pH 7.5, 7 ml MgCl, 1 mm DTT, 1 mm ATP and 10 

units of T4 DNA ligase. This solution was maintained at 
37C for 30 minutes and then the T4 DNA ligase was 
inactivated by maintaining the solution at 65C for 10 
minutes. The end polynucleotides were kinased by mixing 

15 52 /xl of the above reaction, 4 pi of a solution 

containing 10 mM ATP and 5 units of T4 polynucleotide 
kinase. This solution was maintained at 37C for 30 
minutes and then the T4 polynucleotide kinase was 
inactivated by maintaining the solution at 65C for 10 

20 minutes. The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector that had been 
previously digested with the restriction enzymes Not I 
and Xho I. The ligation mixture was packaged according 
to the manufacture's instructions using Gigapack II Gold 

25 packing extract available from Stratagene Cloning 

Systems, La Jolla, CA. The packaged ligation mixture was 
plated on XL1 blue cells (Stratagene Cloning Systems, La 
Jolla, CA) • Individual lambda Zap II plaques were cored 
and the inserts excised according to the in vivo excision 

30 protocol provided by the manufacturer, Stratagene Cloning 
Systems, La Jolla , CA. This in vivo excision protocol 
moves the cloned insert from the lambda Zap II vector 
into a plasmid vector to allow easy manipulation and 
sequencing. The accuracy of the above cloning steps was 

35 confirmed by sequencing the insert using the 
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manufacture's instructions in the AMV Reverse 
Transcriptase 35 S-dATP sequencing kit from Stratagene 
Cloning Systems, La Jolla, CA. The sequence of the 
resulting V L II-expression vector is shown in Figure 9 and 
5 Figure 11. 



TABLE 4 

01) 5 f 3 4TGAATTCTAAACTAGTCGCCAAGGAGACAGTCAT 3» 

02) 5« AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3' 
10 03) 5 1 GTTATTACTCGCTGCCCAACCAGCCATGGCC 3 1 

04) 5 1 GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3« 

05) 5" GTATTTCATTATGACTGTCTCCTTGGCGACTAGTTTAGAA- 

TTCAAGCT 3' 

06) 5 1 CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3' 
15 07) 5 ■ TGACGAGCTCGGCCATGGCTGGTTGGG 3 f 

08) 5' TCGACGGCCGCTTAACTCTAGAAC 3' 



12. V H + V r Library Construction 

To prepare an expression library enriched in V H 

20 sequences, DNA homologs enriched in V H sequences were 

prepared according to Example 6 using the same set of 5 f 
primers but with primer 12 A (Table 1) as the 3» primer. 
These homologs were then digested with the restriction 
enzymes Xho I and Spe I and purified on a 1% agarose gel 

25 using the standard electro-elution technique described in 
Molecular Cloning A Laboratory Manual , Maniatis et al., 
eds. f Cold Spring Harbor, NY, (1982). These prepared V H 
DNA homologs were then directly inserted into the V H 
expression vector that had been previously digested with 

30 Xho I and Spe I. 

The ligation mixture containing the V H DNA 
homologs were packaged according to the manufacturers 
specifications using Gigapack Gold II Packing Extract 
(Stratagene Cloning Systems, La Jolla, CA) . The 

35 expression libraries were then ready to be plated on XL-1 
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Blue cells. 

To prepare a library enriched in V L sequences, 
PCR amplified products enriched in V L sequences were 
prepared according to Example 6. These V L DNA homologs 
5 were digested with restriction enzymes Nco I and Spe I. 

The digested V L DNA homologs were purified on a 1% agarose 
gel using standard electro-elusion techniques described 
in Molecular Cloning A Laboratory Manual , Maniatis et 
al., eds., Cold Spring Harbor, NY (1982), The prepared V L 

10 DNA homologs were directly inserted into the V L expression 
vector that had been previously digested with the 
restriction enzymes Nco I and Spe I. The ligation 
mixture containing the V L DNA homologs were transformed 
into XL-1 blue competent cells using the manufacturer's 

15 instructions (Stratagene Cloning Systems, La Jolla, CA) . 

13. Inserting V, Coding DNA Homologs 

Into V 1 Expression Vector 
In preparation for cloning a library enriched 
in V L sequences, PCR amplified products (2-5 ug/30 /xl of 

20 150 mM NaCl, 8 mM Tris-HCl (pH 7.5), 6 mM MgS0 4 , 1 mM DTT, 

200 ug/ml BSA at 37C were digested with restriction 
enzymes Sac I (125 units) and Xba I (125 units) and 
purified on a 1% agarose gel. In cloning experiments 
which required a mixture of the products of the 

25 amplification reactions, equal volumes (50 pi, 1-10 ug 
concentration) of each reaction mixture were combined 
after amplification but before restriction digestion. 
After gel electrophoresis of the digested PCR amplified 
spleen mRNA, the region of the gel containing DNA 

30 fragments of approximate 350 bps was excised, electro- 

eluted into a dialysis membrane, ethanol precipitated and 
re-suspended in a TE solution containing 10 mM Tris-HCl 
pH 7.5 and 1 mM EDTA to a final concentration of 50 
ng//il • 

35 The V L II-expression DNA vector was prepared for 
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cloning by admixing 100 ug of this DNA to a solution 
containing 250 units each of the restriction 
endonucleases Sac 1 and Xba 1 (both from Boehringer 
Mannheim, Indianapolis, IN) and a buffer recommended by 
5 the manufacturer. This solution was maintained at 37 
from 1.5 hours. The solution was heated at 65C for 15 
minutes top inactivate the restriction endonucleases. 
The solution was chilled to 30C and 25 units of heat- 
killable (HK) phosphatase (Epicenter, Madison, WI) and 

10 CaCl2 were admixed to it according to the manufacturer's 
specifications. This solution was maintained at 30C for 
1 hour. The DNA was purified by extracting the solution 
with a mixture of phenol and chloroform followed by 
ethanol precipitation. The V L II expression vector was now 

15 ready for ligation to the V L DNA homologs prepared in the 
above examples. 

DNA homologs enriched in V L sequences were 
prepared according to Example 5 but using a 5 1 light 
chain primer and the 3' light chain primer shown in Table 

20 2. Individual amplification reactions were carried out 
using each 5 1 light chain primer in combination with the 
3 1 light chain primer. These separate V L homolog 
containing reaction mixtures were mixed and digested with 
the restriction endonucleases Sac 1 and Xba 1 according 

25 to Example 6. The V L homologs were purified on a 1% 

agarose gel using the standard electro-elution technique 
described in Molecular Cloning A Laboratory Manual. 
Maniatis et al., eds., Cold Spring Harbor, NY, (1982). 
These prepared V L DNA homologs were then directly inserted 

30 into the Sac 1 - Xba cleaved V L II-expression vector that 
was prepared above by ligating 3 moles of V L DNA homolog 
inserts with each mole of the V L II-expression vector 
overnight at 5C. 3.0 x 10 5 plague forming units were 
obtained after packaging the DNA with Gigapack II Bold 

35 (Stratagene Cloning Systems, La Jolla, CA) and 50% were 



WO 90/14430 



PCT/US90/02836 



85 

recombinants . 

14, Randomly Combining V H and V t DNA 
Homoloas on the Same Expression 
Vector 

5 The V L II-expression library prepared in Example 

13 was amplified and 500 ug of V L II-expression library 
phage DNA prepared from the amplified phage stock using 
the procedures described in Molecular Cloning: A 
Laboratory Manual , Maniatis et al., eds. , Cold Spring 

10 Harbor, NY (1982), 50 ug of this V L II-expression library 
phage DNA was maintained in a solution containing 100 
units of MLuI restriction endonuclease (Boehringer 
Mannheim, Indianapolis, IN) in 200 /xl of a buffer 
supplied by the endonuclease manufacturer for 1.5 hours 

15 at 37C. The solution was then extracted with a mixture 
of phenol and chloroform. The DNA was then ethanol 
precipitated and re-suspended in 100 Ml of water. This 
solution was admixed with 100 units of the restriction 
endonuclease EcoR I (Boehringer Mannheim, Indianapolis, 

20 IN) in a final volume of 200 fxl of buffer containing the 
components specified by the manufacturer. This solution 
was maintained at 37C for 1.5 hours and the solution was 
then extracted with a mixture of phenol and chloroform. 
The DNA was ethanol precipitated and the DNA re-suspended 

25 in TE. 

The V H expression library prepared in Example 12 
was amplified and 500 ug of V H expression library phage 
DNA prepared using the methods detailed above. 50 ug of 
the V H expression library phage DNA was maintained in a 

30 solution containing 100 units of Hind III restriction 

endonuclease (Boehringer Mannheim, Indianapolis, IN) in 
200 jul of a buffer supplied by the endonuclease 
manufacturer for 1.5 hours at 37C. The solution was then 
extracted with a mixture of phenol and chloroform 

35 saturated with 0.1 M Tris-HCl at pH 7.5. The DNA was 
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then ethanol precipitated and re-suspended in 100 /xl of 
water. This solution was admixed with 100 units of the 
restriction endonuclease EcoR I (Boehringer Mannheim , 
Indianapolis, IN) in a final volume of 200 nl of buffer 
5 containing the components specified by the manufacturer. 

This solution was maintained at 37C for 1.5 hours and the 
solution was then extracted with a mixture of phenol and 
chloroform. The DNA was ethanol precipitated and the DNA 
re-suspended in TE. 

10 The restriction digested V H and V L II-expression 

Libraries were ligated together. The ligation reaction 
consisted of 1 ug of V H and 1 ug of V L II phage library DNA 
in a 10 Ml reaction using the reagents supplied in a 
ligation kit purchased from Stratagene Cloning Systems 

15 (La Jolla, California) . After ligation for 16 hr at 4C, 

1 nl of the ligated the phage DNA was packaged with 
Gigapack Gold II packaging extract and plated on XL 1- 
blue cells prepared according to the manufacturers 
instructions. A portion of the 3X10 6 clones obtained were 

20 used to determine the effectiveness of the combination. 
The resulting V H and V L expression vector is shown in 
Figure 11. 

Clones containing both V H and V L were excised 
from the phage to pBluescript using the in vitro excision 

25 protocol described by Short et al., Nucleic Acid 
fiesearch . 16:7583-7600 (1988). Clones chosen for 
excision expressed the decapeptide tag and did not cleave 
X-gal in the presence of 2mM IPTGthus remaining white. 
Clones with these characteristics represented 30% of the 

30 library. 50% of the clones chosen for excision contained 
a v H and V L as determined by restriction analysis. Since 
approximately 30% of the clones in the V H library 
expressed the decapeptide tag and 50% of the clones in 
the V L II library contained a V L sequence it was 

35 anticipated that no more than 15% of the clones in the 
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combined library would contain both V H and V L clones. The 
actual number obtained was 15% of the library indicating 
that the process of combination was very efficient. 

15. segregating dna Homoloas For a V H 
5 Antigen Binding Protein 

To segregate the individual clones containing 
DNA homologs that code for a V H antigen binding protein, 
the titre of the V H expression library prepared according 
to Example 11 was determined. This library titration was 

10 performed using methods well known to one skilled in the 
art. Briefly, serial dilutions of the library were made 
into a buffer containing 100 mM NaCl, 50 mM Tris-HCl at 
pH 7.5 and 10 mM MgS0 4 . Ten /il of each dilution was added 
to 200 p.1 of exponentially growing e. coli cells and 

15 maintained at 37C for 15 minutes to allow the phage to 
absorb to the bacterial cells. Three ml of top agar 
consisting of 5 g/L NaCl, 2 g/L of MgS0 4 , 5 g/L yeast 
extract, 10 g/L NZ amine (casein hydrolysate) and 0.7% 
melted, 50C agarose. The phage, the bacteria and the top 

20 agar were mixed and then evenly distributed across the 
surface of a prewarmed bacterial agar plate (5 g/L NaCl, 
2 g/L MgS0 4 , 5 g/L yeast extract , 10 g/L NZ amine (casein 
hydrolysate) and 15 g/L Difco agar. The plates were 
maintained at 37C for 12 to 24 hours during which time 

25 period the lambda plaques developed on the bacterial 

lawn. The lambda plaques were counted to determined the 
total number of plaque forming units per ml in the 
original library. 

The titred expression library was then plated 

30 out so that replica filters could be made from the 
library. The replica filters will be used to later 
segregate out the individual clones in the library that 
are expressing the antigens binding proteins of interest. 
Briefly, a volume of the titred library that would yield 

35 20,000 plaques per 150 millimeter plate was added to 600 
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til of exponentially growing E. coli cells and maintained 
at 37C for 15 minutes to allow the phage to absorb to the 
bacterial cells. Then 7.5 ml of top agar was admixed to 
the solution containing the bacterial cells and the 
5 absorbed phage and the entire mixture distributed evenly 
across the surface of a prewarmed bacterial agar plate. 
This process was repeated for a sufficient number of 
plates to plate out a total number of plaques at least 
equal to the library size. These plates were then 

10 maintained at 37 C for 5 hours. The plates were then 
overlaid with nitrocellulose filters that had been 
pretreated with a solution containing 10 mM isopropyl- 
beta-D-thiogalactopyranosid (IPTG) and maintained at 37C 
for 4 hours. The orientation of the nitrocellulose 

15 filters in relation to the plate were marked by punching 
a hole with a needle dipped in waterproof ink through the 
filter and into the bacterial plates at several 
locations. The nitrocellulose filters were removed with 
forceps and washed once in a TBST solution containing 20 

20 mM Tris-HCL at pH 7.5, 150 mM NaCl and 0.05% monolaurate 
(tween-20) . A second nitrocellulose filter that had also 
been soaked in a solution containing 10 mM IPTG was 
reapplied to the bacterial plates to produce duplicate 
filters. The filters were further washed in a fresh 

25 solution of TBST for 15 minutes. Filters were then 

placed in a blocking solution consisting of 20 mM Tris- 
HC1 at pH 7.5, 150 mM NaCL and 1% BSA and agitated for 1 
hour at room temperature. The nitrocellulose filters 
were transferred to a fresh blocking solution containing 

30 a 1 to 500 dilution of the primary antibody and gently 

agitated for at least 1 hour at room temperature. After 
the filters were agitated in the solution containing the 
primary antibody the filters were washed 3 to 5 times in 
TBST for 5 minutes each time to remove any of the 

35 residual unbound primary antibody. The filters were 



WO 90/14430 



PCI7US90/02836 



89 

transferred into a solution containing fresh blocking 
solution and a 1 to 500 to a 1 to 1,000 dilution of 
alkaline phosphatase conjugated secondary antibody. The 
filters were gently agitated in the solution for at least 
5 1 hour at room temperature. The filters were washed 3 to 
5 times in a solution of TBST for at least 5 minutes each 
time to remove any residual unbound secondary antibody. 
The filters were washed once in a solution containing 20 
mM Tris-HCl at pH 7.5 and 150 mM NaCL. The filters were 

10 removed from this solution and the excess moisture 
blotted from them with filter paper. The color was 
developed by placing the filter in a solution containing 
100 mM Tris-HCl at pH 9.5, 100 mM NaCl, 5 mM MgCl 2 , 0.3 
mg/ml of nitro Blue Tetrazolium (NBT) and 0.15 mg/ml of 

15 5-bromo-4-chloro-3-indolyl-phosphate (BCIP) for at least 
30 minutes at room temperature. The residual color 
development solution was rinsed from the filter with a 
solution containing 20 mM Tris-HCl at pH 7.5 and 150 mM 
NaCl. The filter was then placed in a stop solution 

20 consisting of 20 mM Tris-HCl at pH 2.9 and 1 mM EDTA. 

The development of an intense purple color indicates at 
positive result. The filters are used to locate the 
phage plaque that produced the desired protein. That 
phage plaque is segregated and then grown up for further 

25 analysis. 

Several different combinations of primary 
antibodies and second antibodies were used. The first 
combination used a primary antibody immunospecif ic for a 
decapeptide that will be expressed only if the V H antigen 

30 binding protein is expressed in the proper reading frame 
to allow read through translation to include the 
decapeptide epitope covalently attached to the V H antigen 
binding protein. This decapeptide epitope and an 
antibody immunospecif ic for this decapeptide epitope was 

35 described by Green et al., Cell 28:477 (1982) and Niemann 
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et al., Proc. Nat, Acad, Sci. U.S.A. 80:4949 (1983). The 
sequence of the decapeptide recognized is shown in Figure 
11. A functional equivalent of the monoclonal antibody 
that is immunospecific for the decapeptide can be 
5 prepared according to the methods of Green et al. and 
Niemann et al. The secondary antibody used with this 
primary antibody was a goat anti-mouse IgG (Fisher 
Scientific) . This antibody was immunospecific for the 
constant region of mouse IgG and did not recognize any 

10 portion of the variable region of heavy chain. This 
particular combination of primary and secondary 
antibodies when used according to the above protocol 
determined that between 25% and 30% of the clones were 
expressing the decapeptide and therefore these clones 

15 were assumed to also be expressing a V H antigen binding 
protein. 

In another combination the anti-decapeptide 
mouse monoclonal was used as the primary antibody and an 
affinity purified goat anti-mouse Ig r commercially 

20 available as part of the picoBlue immunoscreening kit 

from Stratagene Cloning System, La Jolla, CA, was use as 
the secondary antibody. This combination resulted in a 
large number of false positive clones because the 
secondary antibody also immunoreacted with the V H of the 

25 heavy chain Therefore this antibody reacted with all 
clones expressing any V H protein and this combination of 
primary and secondary antibodies did not specifically 
detect clones with the V H polynucleotide in the proper 
reading frame and thus allowing expressing of the 

30 decapeptide. 

Several combinations of primary and secondary 
antibodies are used where the primary antibody is 
conjugated to fluorescein isothiocyanate (FITC) and thus 
the immunospecificity of the antibody was not important 

35 because the antibody is conjugated to the preselected 
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antigen (FITC) and it is that antigen that should be 
bound by the V H antigen binding proteins produced by the 
clones in the expression library. After this primary 
antibody has bound by virtue that is FITC conjugated 
5 mouse monoclonal antibody p2 5764 (ATCC #HB-9505) . The 
secondary antibody used with this primary antibody is a 
goat anti-mouse Ig 6 (Fisher Scientific, Pittsburg, PA) 
conjugated to alkaline phosphatase. Using the method 
described in Antibodies A Laboratory Manual , Harlow and 

10 Lowe, eds., Cold Springing Harbor, NY, (1988). If a 

particular clone in the V H expression, library, expresses 
a V H binding protein that binds the FITC covalently 
coupled to the primary antibody, the secondary antibody 
binds specifically and when developed the alkaline 

15 phosphate causes a distinct purple color to form. 

The second combination of antibodies of the 
type uses a primary antibody that is FITC conjugated 
rabbit anti-human IgG (Fisher Scientific, Pittsburg, PA) . 
The secondary antibody used with this primary antibody is 

20 a goat anti-rabbit IgG conjugated to alkaline phosphatase 
using the methods described in Antibodies A Laboratory 
Manual , Harlow and Lane, eds., Cold Spring Harbor, NY, 
(1988) . If a particular clone in the V H expression 
library expresses a V H binding protein that binds the FITC 

25 conjugated to the primary antibody, the secondary 
antibody binds specifically and when developed the 
alkaline phosphatase causes a distinct purple color to 
form. 

Another primary antibody was the mouse 
30 monoclonal antibody p2 5764 (ATCC # HB-9505) conjugated 

to both FITC and 125 I. The antibody would be bound by any 
V H antigen binding proteins expressed. Then because the 
antibody is also labeled with 125 I, an autoradiogram of 
the filter is made instead of using a secondary antibody 
35 that is conjugated to alkaline phosphatase. This direct 
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production of an autoradiogram allows segregation of the 
clones in the library expressing a V H antigen binding 
protein of interest. 

16. Segregating DffA Homologs For a 
5 Yh frpd V L that Form an fintigen 

To segregate the individual clones containing 
DNA homologs that code for a V H and a V L that form an 
antigen binding F v the V H and V L expression library was 

10 titred according to Example 15. The titred expression 
library was then screened for the presence of the 
decapeptide tag expressed with the V H using the methods 
described in Example 15. DNA was then prepared from the 
clones to express the decapeptide tag. This DNA was 

15 digested with the restriction endonuclease Pvu II to 
determine whether these clones also contained a V L DNA 
homolog. The slower migration of a PvuII restriction 
endonuclease fragment indicated that the particular clone 
contained both a V H and a V L DNA homolog. 

20 The clones containing both a V H and a V L DNA 

homolog were analyzed to determine whether these clones 
produced an assembled F v protein molecule from the V H and 
V L DNA homologs. 

The F v protein fragment produced in clones 

25 containing both V H and V L was visualized by immune 

precipitation of radiolabeled protein expressed in the 
clones. A 50 ml culture of LB broth (5 g/L yeast 
extract, 10 g/L and tryptone 10 g/L NaCl at pH 7.0) 
containing 100 ug/^1 of ampicillin was inoculated with 

30 Coli harboring a plasmid contain a V H and a V L . The 
culture was maintained at 37C with shaking until the 
optical density measured at 550 nm was 0.5 culture then 
was centrifuged at 3,000 g for 10 minutes and re- 
suspended in 50 ml of M9 media (6 g/L Na 2 HP0 4 , 3 g/L 

35 KH 2 P0 4 , 0.5 g/L NaCl, 1 g/L NH 4 C1, 2g/L glucose, 2 mM 
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MgS0 4 and 0.1 mMgSO* CaCl 2 supplemented with amino acids 
without methionine or cysteine. This solution was 
maintained at 37C for 5 minutes and then 0.5 mCi of 35 S as 
HS0 4 " (New England Nuclear, Boston, MA) was added and the 
5 solution was further maintained at 37C for an additional 
2 hours. The solution was then centrifuged at 3000xg and 
the supernatant discarded. The resulting bacterial cell 
pellet was frozen and thawed and then re-suspended in a 
solution containing 40 mM Tris pH 8.0, 100 mM sucrose and 
10 1 mM EDTA. The solution was centrifuged at lOOOOxg for 
10 minutes and the resulting pellet discarded. The 
supernatant was admixed with 10 Ml of anti-decapeptide 
monoclonal antibody and maintained for 30-90 minutes on 
ice. 40 /il of protein G coupled to sepharose beads 
15 (Pharmacia, Piscataway, NJ) was admixed to the solution 

and the added solution maintained for 30 minutes on ice 
to allow an immune precipitate to form. The solution was 
centrifuged at 10,000 xg for 10 minutes and the resulting 
pellet was re-suspended in 1 ml of a solution containing 
100 mM Tris-HCl at pH 7.5 and centrifuged at 10,000 xg 
for 10 minutes. This procedure was repeated twice. The 
resulting immune precipitate pellet was loaded onto a 
PhastGel Homogenous 20 gel (Pharmacia, Piscataway, NJ) 
according to the manufacturer's directions. The gel was 
dried and used to expose X-ray film. 

The resulting autoradiogram is shown in Figure 
12. The presence of assembled F v molecules can be seen by 
the presence of V t that was immunoprecipitated because it 
was attached to the V H -decapeptide tag recognized by the 
precipitating antibody. 

17. Construction of Selectable V H and Vj 
Expression 

A. Construction of the Mutant S Gene 
Expression Plasmid 
The bacteria phage lambda S gene has been shown 



WO 90/14430 



PCT/US90/02836 



94 

to be directly involved in lysis as described by Reader 
et al., Virology . 43:607-622 (1971) • The S gene encodes 
a 107 amino acid polypeptide that is responsible for a 
lethal event in the cytoplasmic membrane that allows the 

5 release of the R gene product into the periplasm of the 
cell where it degrades the peptidoglycan as described by 
Garrett et al., J. Virology . 44:886-892 (1982). The 
dominant S gene mutant (S 10 o SAM 5) is a mutation that has 
been shown to interfere with the formation of the normal 

0 S protein membrane channel thus preventing cell lysis. 

See Raab et al., J. Mol, Biol. , 199:95-105 (1988). This 
mutant S gene is dominant because when it is expressed, 
even in the presence of the wild type S protein it 
prevents lysis of the bacterial cell. The S 100 SAM 5 

5 dominant mutation also contains an amber mutation and 

therefore requires the expression of a suppressor tRNA in 
the bacterial cell in order for mutant S protein to be 
produced. Further, this amber mutation allows the growth 
of bacteria containing the mutant S gene construct 

0 without lysis because without this amber suppressing tRNA 
no functional S gene protein is produced. 

The dominant S gene from Lambda Zap Sam 5 was 
isolated using the polymerase chain reaction. Briefly, 
Lambda Zap Sam 5 DNA was isolated using the methods 

5 described in Molecular Cloning: A Laboratory Manual. 
Maniatis et al., eds., Cold Spring Harbor, NY (1982). 
Lambda Zap Sam 5 DNA, 0.1 ug, was admixed with a buffer 
containing 150 ng of primer RG15 (Table 5) and 150 ng of 
primer RG16 (Table 5), 0.25 mM each of dTTP, dCTP, dGTP, 

D and dATP (dNTPs) , 50 mM KC1, 10 mM Tris-HCl at pH 8.3, 

1.5 mM MgCl 2 , and 0.15% sterile gelatin. The resulting 
solution was heated to 91C for five minutes and then 
placed in a 54C water bath for five minutes. 0.5 
microliters of Taq polymerase (Perkin Elmer-Cetus, 

5 Norwalk, CT) was added and the solution overlaid with a 
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layer of mineral oil. 

The solution was then placed in a DNA Thermal 
Cycler (PerKin-Elmer Cetus, Norwalk, CT) and subjected to 
the following temperature and time conditions: (1) 72C 
5 for two minutes to allow for primer extension, (2) 91C 
for one minute to heat denature the duplex DNA and (3) 
54C for two minutes to allow the single-stranded nucleic 
acids to hybridize. The same solution was subjected to 
further cycles of steps (1), (2), and (3) for a total of 
10 thirty cycles according to the manufacturer^ 

instructions. The cycled solution was then maintained at 
72C for ten minutes and then stored at 4C until used. 

The mutant S gene DNA produced by the above 
polymerase chain reaction was digested with the 
15 restriction endonucleases Hind III and Bgl II. Briefly, 
one half of the polymerase chain reaction product 
produced above was purified by phenol extraction followed 
by ethanol precipitation. The DNA was then admixed with 
a solution containing 100 mM NaCl, 10 mM Tris-HCl at pH 
7.7, 19 mM Mg Cl 2 , 1 mM DTT, 100 ug/ml BSA, 20 units of 
Hind III and 10 units of Bgl II. This solution was 
maintained at 37C for one hour. The efficiency of this 
restriction endonuclease digestion was determined by gel 
electrophoresis according to the methods described in 
Current Protocols in Molecular Biology , Ausubel et al., 
eds., John Wiley and Sons, NY (1987). 

One half of the polymerase chain reaction 
product was digested with the restriction endonucleases 
Sau 3A and Bgl II. Briefly, the DNA was admixed with a 
buffer containing 100 mM NaCl, 10 mM Tris-HCl at pH 7.7, 
10 mM MgCl 2 , 1 mM DTT, 100 ug/ml bovine serum albumin 
(BSA) , 10 units of Sau 3A, and 10 units of Bgl II 
(Stratagene, La Jolla, CA) . This solution was maintained 
at 37C for one hour. The efficiency of this restriction 
endonuclesis digestion was determined by gel 
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electrophoresis . 

The resulting predominant, approximately 500 
base pair bands were isolated and purified on agarose 
gels according to the procedures described in Molecular 

5 Cloning: ft Laboratory Manual, Maniatis et al., eds., 

Cold Spring Harbor, NY (1982). The DNA was purified from 
the agarose slices by electro-elution according to the 
methods described in Molecular Cloning: A Laboratory 
Manual . Maniatis et al., eds., Cold Spring Harbor, NY 

10 (1982) . The electro-eluted DNA was purified by phenol 
extraction followed by ethanol precipitation. 

The mutant S gene was inserted into pBluescript 
KS+ (Stratagene) that had been previously digested with 
the restriction endonuclease Hind III and BamH I. 

15 Briefly, the pBluescript KS+ was admixed with a buffer 
containing 100 mM Nacl, 10 mM Tris-HCl at pH 7.7, 10 mM 
Mg Cl 2 , 1 mM DTT, 100 ug/ml BSA, 40 units of BamH I and 40 
units of Hind III (Stratagene) . This solution was 
maintained at 37C for one hour. The pBluescript KS+ 

20 containing solution was then adjusted to pH 8.0 by adding 
Tris-HCl at pH 8.0 to a final concentration of 0.1 M. 
Five units of calf intestinal alkaline phosphatase 
(Stratagene) was added to this solution and the solution 
maintained at 37C for 30 minutes. The calf intestine 

25 alkaline phosphatase was then inactivated by maintaining 
the solution at 65C for 10 minutes. The pBluescript KS+ 
was then purified by phenol extraction followed by 
ethanol precipitation. The restriction endonuclease 
cleaned pBluescript KS+ was then re-suspended in a 

30 solution containing 10 mM Tris-HCl at pH 8.0 and 1 mM 
EDTA. 

The mutant S gene was inserted (ligated) into 
the pBluescript vector prepared above by digestion with 
Hind III and BamH I restriction endonuclease. Briefly, 1 
35 Ml of pBluescript vector that had been previously cut 
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with Hind III and BamH I was admixed with 1 jil of the 
mutant S gene insert prepared above, 1 jil of a buffer 
containing 0.66 M Tris-HCl at pH 7.6, 50 mM MgCl 2 , 50 mM 
Dithiothreitol (DTT) and 1 pi of a solution containing 10 
5 mM ATP and 0.5 pi (4 units) of T4 DNA ligase 

(Stratagene) . This solution was maintained at 37C for 
one hour. 

The ligation mixture was transformed in XL1 
Blue cells (Stratagene) according to the manufacture's 
10 directions. 

The accuracy of the above cloning steps is 
confirmed by DNA sequencing. 



B. Selectable V H -Expression Vector 

15 Construction 

To add the ability to select against expression 
vectors not containing v H -coding DNA homologs, a 
suppressor tRNA gene was inserted into the V H -Expression 
vector prepared in Example 9. The selectable V H - 

20 Expression vector was prepared by inserting a synthetic 
DNA sequence containing the suppressor tRNA gene and DNA 
sequence coding for the decapeptide tag into the VH- 
Expression vector prepared in Example 9 that had been 
previously cleaved with the restriction endonucleases Xho 

25 I and Eco RI. 

A synthetic DNA sequence containing the 
suppressor tRNA gene and polynucleotide sequence coding 
for decapeptide tag was constructed by designing single 
stranded polynucleotide segments of 20-40 bases that 

30 would hybridize to each other and form the double 

stranded synthetic DNA sequence shown in Figure 18 A. The 
individual single-stranded polynucleotides are shown in 
Table 5. 



WO 90/14430 



98 



PCT/US90/02836 



CQ 
< 




J I 



Eh 
O 

a 

cd cd 



u 

CD cn 



CD < 

a cd 



i 



a 
i 



i 

u 
a 
a 

o 
i 



CO 

I 

cd 

Eh 
U 
En 
Eh 



n 

i 

cd 

CD 
CD 

a 

CD 



a o 

g 

a 

< cd 

cd u 

cd u 

u o 

U EH 



cd 
a 

2 



Eh 

cd 



cd 



a cd 

- < a 

m cd cd 
i u 



mtninininininmm in* in 



3 

cd 
u 

EH 
Eh 

. . . f 

m in in m 



Eh 
O 

cd cd 

< a 

cd u 

a u 

CD 
CD 



Eh 

a cd 

*5 -Eh 
CD U 
1 I 



in . vo 

H . H 
vo VO 
PS « 



*o co cn o 
cn cn cn cn r> 
cn cn cn cn cn 



rH o H cn n rr 
cn I s * 

cn cn cn cn cn cn cn 



cn 

m cm 



CN 

CO CQ 
< < 



in 



o 

H 



in 

rH 



o 

CN 



WO 90/14430 



PCT/US90/02836 



99 

Polynucleotides 926, 927, 928, 929, 930, 931, 
AB23 and 971 were kinased by adding 1.0 Ml of each 
polynucleotide (0.1 ug/Ml) and 20 units of T 4 
polynucleotide kinase to a solution containing 70 mM 
5 Tris-HCl at pH 7.6, 10 mM MgCl 2 , 5 mM DTT, 10 mM 2 ME, 500 
micrograms per ml of BSA. The solution was maintained at 
37C for 30 minutes and the reaction stopped by 
maintaining the solution at 65C for 10 minutes. 

The required polynucleotides were annealed to 
10 form the synthetic DNA sequence shown in Figure 18 A. 

Briefly, the following solutions of polynucleotides were 
admixed to 1/10 volume of a solution containing 20.0 mM 
Tris HC1 at pH 7.4, 2.0 mM MgCl 2 and 50.0 mM NaCl; 5 Ml of 
separate, 2.5 ug/ml solutions containing the kinased 
15 polynucleotides 926, 927, 928, 929, 930 and 931; 4 Ml of 

separate, 2.0 ug/ml solutions containing the unkinased 
polynucleotide AB24 and the kinased polynucleotide AB23; 
2 Ml of separate, 1.0 ug/ml solutions containing the 
kinased polynucleotide 971, and the unkinased 
polynucleotide 970. 

This solution was heated to 70C for 5 minutes 
and allowed to cool to 40C over 1.5 hours in a 500 ml 
beaker of water. During this time period all 10 
polynucleotides annealed to form the double stranded 
synthetic DNA insert shown in Figure 18A. The individual 
polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert by admixing all of the 
above reaction (46.6 Ml) to a solution containing 50 mM 
Tris-HCl at pH 7.5, 7 mM MgCl 2 , 1 mM DDT, 1 mM adenosine 
triphosphate (ATP) and 10 units of T4 DNA ligase to form 
a ligation reaction admixture. This admixture was 
maintained at 37C for 1 hour and then the T4 DNA ligase 
was inactivated by maintaining the solution at 65C for 15 
minutes. The end polynucleotides were kinased by 
admixing all of the above ligation reaction admixture 
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reaction, 6 /il of a solution containing 10 mM ATP and 5 
units of T4 polynucleotide kinase. This solution was 
maintained at 37C for 30 minutes and then the T4 
polynucleotide kinase was inactivated by maintaining the 
5 solution at 65C for 10 minutes. The completed synthetic 
DNA insert (Figure 18 A) was ready for ligation to the V H - 
Expression vector (Figure 7) that had been previously 
digested with the restriction endonucleases Xho I and Eco 
RI. 

10 The V H -Expression vector (Figure 7) was digested 

with the restriction endonucleases Xho I and Eco RI, 
according to the manufacturers recommendations. Briefly, 
50 ug the V H -Expression vector (38.5 /il) , 225 units of Xho 
I (Stratagene) and 150 units of Eco RI (Stratagene) , were 
15 admixed to a universal restriction endonuclease buffer 

consisting of 50 mM Tris-HCl at pH 7.7, 10 mM MgCl 2 , 50 mM 
NaCl and 100 ug/ml BSA to form a digestion admixture. 
The digestion admixture was maintained at 37C for 2 
hours. 

The digestion admixture was then adjusted to pH 
8.0 by adding a solution of 1.0 m Tris-HCl at pH 8.0 to a 
final concentration of 0.1 M. 2.5 units of calf 
intestine alkaline phosphatase (Stratagene) was added to 
this solution and the resulting solution maintained at 
37C for 30 minutes. The calf intestine alkaline 
phosphatase was inactivated by maintaining the solution 
at 65C for 10 minutes. The V H -Expression vector DNA was 
then purified by phenol extraction followed by ethanol 
precipitation. The restriction endonuclease cleaved VH- 
Expression vector DNA was then re-suspended in 50 /il of a 
solution containing 10 mM Tris-HCl at pH 8.0 and 1 mM 
EDTA. 

The synthetic DNA insert prepared above was 
inserted into the restriction endonuclease cleaved V H - 
Expression vector. Briefly, 1 ug of Xho I and Eco RI 
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cleaved V H -Expression vector, 2 /il of synthetic DNA insert 
(0.5 ug) and 0.5 Ml (4 units) of T4 DNA ligase 
(Stratagene) was admixed to a solution containing 66 mM 
Tris-HCl at pH 7.6, 5.0 mM MgCl 2/ 5.0 mM DTT and 1.0 mM 
5 ATP to form a ligation admixture. The ligation admixture 
was maintained at 37C for 2 hours. The ligation mixture 
was then packaged according to the manufacturer's 
instructions using Gigapack II Gold packing extract 
available from Stratagene. The packaged ligation mixture 

10 was then plated on XL1 blue cells (Stratagene) . 

Individual lambda phage plaques were selected 
for DNA sequencing by selecting individual plaques that 
hybridized to polynucleotides contained in the synthetic 
DNA insert according to the method described in Current 

15 Protocols i n Molecular Biology, Ausubel et al., eds., 

John Wiley and Sons, NY (1987). The selectable V„ 
expression vector is shown in Figure 19A. 

C Selectable V { -Exp ression Vector 

20 Construction 

To add the ability to select against expression 
vectors not containing V L -coding DNA homologs, a 
suppressor tRNA gene was inserted into the V L -Expression 
vector prepared in Example 11. The selectable V L - 

25 Expression vector was prepared by inserting a synthetic 

DNA sequence containing the suppressor tRNA gene into the 
V L -Expression vector prepared in Example 11 that had been 
previously cleaved with the restriction endonucleases Sac 
I and Xba I. 

30 A synthetic DNA sequence containing the 

suppressor tRNA gene was constructed by designing single 
stranded polynucleotide segments of 20-40 bases that 
would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 18B. The 

35 individual single-stranded polynucleotides are shown in 
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Table 5. 

Polynucleotides 926 , 927, 928, 929, 930, 972 
and 975 were kinased by adding 1.0 fil of each 
polynucleotide (0.1 ug//il) and 20 units of T 4 
5 polynucleotide kinase to a solution containing 70 mM 

Tris-HCl at pH 7.6, 10 mM MgCl 2 , 5 mM DTT, 10 mM 2 ME, 100 
micrograms per ml of BSA. The solution was maintained at 
37C for 30 minutes and the reaction stopped by 
maintaining the solution at 65C for 10 minutes. 

10 The required polynucleotides were annealed to 

form the synthetic DNA sequence shown in Figure 18B. 
Briefly, the following solutions of polynucleotides were 
admixed to 1/10 volume of a solution containing 20.0 mM 
Tris HC1 at pH 7.4, 2.0 mM MgCl 2 and 50.0 mM NaCl; 5 pi of 

15 separate, 2.5 ug/ml solutions containing the kinased 

polynucleotides 926, 927, 928, 929, 930 and 931; 2 /xl of 
separate, 2.0 ug/ml solutions containing the unkinased 
polynucleotides 974 and 973, and the kinased 
polynucleotides 972 and 975. 

20 This solution was heated to 70C for 5 minutes 

and allowed to cool to 40C over 1.5 hours in a 500 ml 
beaker of water. During this time period all 10 
polynucleotides annealed to form the double stranded 
synthetic DNA insert shown in Figure 18 B. The individual 

25 polynucleotides were covalently linked to each other to 

stabilize the synthetic DNA insert by admixing all of the 
above reaction (42.2 fil) to a solution containing 50 mM 
Tris-HCl at pH 7.5, 7 mM MgCl 2 , 1 mM DDT, 1 mM adenosine 
triphosphate (ATP) and 10 units of T4 DNA ligase to form 

30 a ligation reaction admixture. This admixture was 

maintained at 37C for 1 hour and then the T4 DNA ligase 
was inactivated by maintaining the solution at 65C for 15 
minutes. The end polynucleotides were kinased by 
admixing all of the above ligation reaction admixture 

35 reaction, 6 fil of a solution containing 10 mM ATP and 5 
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units of T4 polynucleotide kinase* This solution was 
maintained at 37C for 30 minutes and then the T4 
polynucleotide kinase was inactivated by maintaining the 
solution at 65C for 10 minutes, the completed synthetic 
5 DNA insert (Figure 18B) was ready for ligation to the V L - 
Expression vector (Figure 9) that had been previously 
digested with the restriction endonucleases Sac I and Xba 
I. 

The V L -Expression vector (Figure 9) was digested 
10 with the restriction endonucleases Sac I and Xba I, 
according to the manufacturer's recommendations. 
Briefly, 5.0 ug the V L -Expression vector (30.5 /il) , 50 
units of Sac I (Stratagene) and 50 units of Xba I 
(Stratagene) , were admixed to a universal restriction 

15 endonuclease buffer consisting of 10 mM Tris-HCl at pH 

7.7, 10 mM MgCl 2/ 100 mM NaCl and 100 ug/ml BSA to form a 
digestion admixture. The digestion admixture was 
maintained at 37C for 2 hours. 

The digestion admixture was then adjusted to pH 

20 8.0 by adding a solution of 1.0 m Tris-HCl at pH 8.0 to a 
final concentration of 0.1M. 2.5 units of calf intestine 
alkaline phosphatase (Stratagene) was added to this 
solution and the resulting solution maintained at 37C for 
3 0 minutes. The calf intestine alkaline phosphatase was 

25 inactivated by maintaining the solution at 65C for 10 

minutes. The V L -Expression vector DNA was then purified 
by phenol extraction followed by ethanol precipitation. 
The restriction endonuclease cleaved V L -Expression vector 
DNA was the re-suspended in 50 til of a solution 

30 containing 10 mM Tris-HCl at pH 8.0 and 1 mM EDTA. 

The synthetic DNA insert prepared above was 
inserted into the restriction endonuclease cleaved VL- 
Expression vector. Briefly, 1 ug of Sac I and Xba I 
cleaved V L -Expression vector, 2 /il of synthetic DNA insert 

35 (0.5 ug) and 0.5 m! (4 units) of T4 DNA ligase 
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(Stratagene) was admixed to a solution containing 66 mM 
Tris-HCl at pH 7.6, 5.0 mM MgCl 2 , 5.0 mM DTT and 1.0 mM 
ATP to form a ligation admixture. The ligation admixture 
was maintained at 37C for 2 hours. The ligation mixture 
5 was packaged according to the manufacturer's instructions 
using Gigapack II Gold packing extract available from 
Stratagene. The packaged ligation mixture was plated on 
XLl blue cells (Stratagene) . 

Individual lambda phage plagues were selected 

10 for DNA sequencing by screening for plaques that 

hybridized to polynucleotides contained in the synthetic 
DNA insert according to the methods described in 
Molecular Cloning; A Laboratory Manual , Maniatis et al . , 
eds., Cold Spring Harbor, NY (1989). The selectable V t 

15 expression vector is shown in Figure 19B. 

D. Construction of a Selectable V t and 
V^z Expression Vector 
The V H -Expression vector prepared in Example 17B 
is modified so that it does not contain any Xho I or Spe 
I restriction endonuclease sites. This modification of 
this vector is accomplished using a set of 
polynucleotides and methods similar to the methods 
described in Example 17B. 

The V L -Expression vector prepared in Example 17C 
is modified so that it does not contain any Sac I or Xba 
I restriction endonuclease sites. This modification of 
the V L -Expression vector is accomplished using a set of 
polynucleotides and methods well known in the art and 
similar to the methods described in Example 17C. The 
modified V L -Expression vector and the modified VH- 
Expression vector are combined to produce a selectable V L 
and V H Expression vector. Briefly, the modified VH- 
Expression vector is digested with the restriction 
endonucleases Eco RI and Hind III using the conditions 
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recommended by the enzyme manufacturer and is digested 
with the restriction endonucleases Eco RI and Mlu I. The 
restriction endonuclease cleaved V H and V L Expression 
vectors are the ligated together using standard 
5 techniques to form the selectable V H and V t Expression 
vector shown in Figure 20 . 

The V H and V L Expression vector contains 2 
suppressor tRNA genes, one is replaced by the V H DNA 
homolog and the other is replaced by the V L DNA homolog. 
10 Therefore, when the vector contains both a V H and a V L DNA 
homolog, the vector does not contain a suppressor tRNA 
gene allowing the V H and V L containing vector to produce 
phage plaques under the appropriate selection conditions, 

15 

E. Inserting DNA Homoloas into the 

Selectable DNA Expression Vectors 
V H coding and/or V L coding DNA homologs prepared 
in Example 5 are inserted into the V H and V L expression 

20 vector, the V H expression vector, or the V L expression 

vector using the provided restriction endonuclease sites. 
The V H coding DNA homologs are typically inserted into the 
provided Xho I and Spe I restriction endonuclease sites 
(Figure 20) using standard procedures. The V L coding DNA 

25 homologs are typically inserted into the provided 

restriction endonuclease sites (Figure 20) . Therefore, 
depending on the particular expression vector selected, 
the methods described herein produce an expression vector 
containing a V H coding DNA homolog alone, a V L coding DNA 

30 homolog alone, or a V H and a V L DNA homolog. 

The V H coding DNA homologs may be inserted into 
the expression vector first, followed by the V L DNA 
homologs. Alternatively, the V L coding homologs may be 
inserted first followed by the V H coding homologs. Either 

35 insertion order allows the random recombination of a 
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library of V H coding DNA homologs with a library V L coding 
DNA homologs. After the V H homologs have been inserted 
into the V H + V L expression vector, the expression vector 
can be grown to produce more of the V H containing 
5 expression vector. The V L coding DNA homologs can then be 
inserted into the V H and V L expression vector. Any of 
these procedures will allow the production of a large 
combinatorial library. 

10 F. Selection of V H and/or V { DNA Homoloq 

Containing Phage 
A strong selection system is employed in order 
to reduce the number of expression vectors present in the 
final library that do not contain V H and/or V L DNA 

15 homologs. The selection system combines the dominant 

Lambda S gene mutation with the suppressor tRNA that is 
present in V H and/or V L expression vectors. When the 
suppressor tRNA is present in the expression vector , the 
mutant Lambda S protein is produced preventing the lysis 

20 of the infected cell and thereby preventing the formation 
of a phage plaque. When a DNA homolog replaces the 
suppressor tRNA, the expression vector can produce a 
phage plaque. In order to detect a V H and/or V L the V H 
and/or V L expression vector must produce a phage plaque 

25 because without plaque production there is not enough V H 
and V L expressed to detect using either immunologic or 
binding assays. Therefore, phages not containing a V H 
and/or V L will not be detected. To accomplish this 

selection, appropriate host bacterial cells containing 

30 the mutant S gene plasmid produced in Example 17A are 
infected with the desired expression vector library. 
Only the expression vectors without suppressor tRNA 
genes, the expression vectors containing DNA homologs, 
produce phage plaques. 



35 



WO 90/14430 



PCT/US90/02836 



107 

18. Generation of a Large Combinatorial 

T.ibrarv of the I mmunoglobulin Repertoire 
in Phage 

Vectors suitable for expression of V Hf V L/ Fv 
and Fab sequences are diagrammed in Figures 7 and 9. As 
previously discussed, the vectors were constructed by 
modification of Lambda Zap by inserting synthetic 
oligonucleotides into the multiple cloning site. The 
vectors were designed to be antisymmetric with respect to 
the Not I and EcoR I restriction sites which flank the 
cloning and expression sequences. As described below, 
this antisymmetry in the placement of restriction sites 
in a linear vector like bacteriophage is the essential 
feature of the system which allows a library expressing 
light chains to be combined with one expressing heavy 
chains to construct combinatorial Fab expression 
libraries. Lambda Zap II V L II (Figure 9) is designed to 
serve as a cloning vector for light chain fragments and 
Lambda Zap II V H (Figure 7) is designed to serve as a 
cloning vector for heavy chain sequences in the initial 
step of library construction. These vectors are 
engineered to efficiently clone the products of PCR 
amplification with specific restriction sites 
incorporated at each end. 

A. PCR Amplif ication of Antibody 
Fragments 

The PCR amplification of mRNA isolated from 
spleen cells with oligonucleotides which incorporate 
restriction sites into the ends of the amplified product 
can be used to clone and express heavy chain sequences 
including Fd and kappa chain sequences. The 
oligonucleotide primers used for these amplifications are 
presented in Tables 1 and 2. The primers are analogous 
to those which have been successfully used in Example 5 
for amplification of V H sequences. The set of 5 1 primers 
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for heavy chain amplification were identical to those 
previously used to amplify V H and those for light chain 
amplification were chosen on similar principles, Sastry 
et al., Proc. Natl. Acad. Sci. USA . 8G: 5728 (1989) and 
5 Orlandi et al., Proc. Natl. Acad. Sci. USA r 8G:3833 

(1989). The unique 3 f primers of heavy (IgGl) and light 
(Jc) chain sequences were chosen to include the cysteines 
involved in heavy-light chain disulfide bond formation. 
At this stage no primer was constructed to amplify lambda 

10 light chains since they constitute only a small fraction 
of murine antibodies. In addition, Fv fragments have 
been constructed using a 3 f primer which is complementary 
to the mRNA in the J (joining) region (amino acid 128) 
and a set of unique 5 1 primers which are complementary to 

15 the first strand cDNA in the conserved N-terminal region 
of the processed protein. Restriction endonuclease 
recognition sequences are incorporated into the primers 
to allow for the cloning of the amplified fragment into a 
lambda phage vector in a predetermined reading frame for 

20 expression. 

B. Library Construction 
The construction of a combinatorial library was 
accomplished in two steps. In the first step, separate 
heavy and light chain libraries were constructed in 
25 Lambda Zap II V H and Lambda Zap II V L II respectively. In 
the second step, these two libraries were combined at the 
antisymmetric EcoRl sites present in each vector. This 
resulted in a library of clones each of which potentially 
co-expresses a heavy and a light chain. The actual 
combinations are random and do not necessarily reflect 
the combinations present in the B-cell population in the 
parent animal. Lambda Zap II V H expression vector has 
been used to create a library of heavy chain sequences 
from DNA obtained by PCR amplification of mRNA isolated 
from the spleen of a 129 G ix + mouse previously immunized 
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with p-nitrophenyl phosphonamidate (NPN) antigen 1 
according to formula I (Figure 13) conjugated to keyhole 
limpet hemocyanin (KLH) . The NPN-KLH conjugate was 

prepared by admixture of 250 til of a solution containing 
5 2,5 mg of NPN according to formula 1 (Figure 13) in 

dimethyl formamide with 750 pi of a solution containing 2 
mg of KLH in 0.01 M sodium phosphate buffer (pH 7.2). 
The two solutions were admixed by slow addition of the 
NPN solution to the KLH solution while the KLH solution 
10 was being agitated by a rotating stirring bar. 

Thereafter the admixture was maintained at 4° for 1 hour 
with the same agitation to allow conjugation to proceed. 
The conjugated NPN-KLH was isolated from the 
nonconjugated NPN and KLH by gel filtration through 
15 Sephadex G-25. The isolated NPN-KLH conjugate was used 

in mouse immunizations as described in Example 2. 

The spleen mRNA resulting from the above 
immunizations was isolated and used to create a primary 
library of V H gene sequences using the Lambda Zap II V H 
expression vector. The primary library contains 1.3 x 10 6 
pfu and has been screened for the expression of the 
decapeptide tag to determine the percentage of clones 
expressing Fd sequences. The sequence for this peptide 
is only in frame for expression following the cloning of 
a Fd (or V H ) fragment into the vector. At least 80% of 
the clones in the library express Fd fragments based on 
immune-detection of the decapeptide tag. 

The light chain library was constructed in the 
same way as the heavy chain and shown to contain 2.5 x 10 6 
members. Plaque screening, using an anti-kappa chain 
antibody, indicated that 60% of the library contained 
expressed light chain inserts. This relatively small 
percentage of inserts probably resulted from incomplete 
dephosphorylation of vector after cleavage with Sac I and 
Xba I. 
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Once obtained, the two libraries were used to 
construct a combinatorial library by crossing them at the 
EcoR I site. To accomplish the cross, DNA was first 
purified from each library. The light chain library was 
5 cleaved with Mlul restriction endonuclease, the resulting 
5' ends dephosphorylated and the product digested with 
EcoR I- This process cleaved the left arm of the vector 
into several pieces but the right arm containing the 
light chain sequences, remained intact. In a parallel 
10 fashion, the DNA of heavy chain library was cleaved with 
Hindlll, dephosphorylated and cleaved with EcoR I, 
destroying the right arm but leaving the left arm 
containing the heavy chain sequences intact. The DNA 1 s 
so prepared were then combined and ligated. After 
15 ligation only clones which resulted from combination of a 
right arm of light chain-containing clones and a left arm 
of heavy chain-containing clones reconstituted a viable 
phage. After ligation and packaging, 2.5 x 10 7 clones 
were obtained. This is the combinatorial Fab expression 
20 library that was screened to identify clones having 

affinity for NPN. To determine the frequency the phage 
clones which co-express the light and heavy chain 
fragments, duplicate lifts of the light chain, heavy 
chain and combinatorial libraries were screened as above 
25 for light and heavy chain expression. In this study of 

approximately 500 recombinant phage approximately 60% co- 
expressed light and heavy chain proteins. 

C. Antigen Binding 
All three libraries, the light chain, the heavy 
chain and Fab were screened to determine if they 
contained recombinant phage that expressed antibody 
fragments binding NPN. In a typical procedure 30,000 
phage were plated and duplicate lifts with nitrocellulose 
screened for binding to NPN coupled to 125 I labeled BSA 
(Figure 15). Duplicate screens of 80,000 recombinant 
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phage from the light chain library and a similar number 
from the heavy chain library did not identify any clones 
which bound the antigen. In contrast, the screen of a 
? similar number of clones from the Fab expression library 

5 identified many phage plagues that bound NPN (Figure 15) . 
This observation indicates that under conditions where 
many heavy chains in combination with light chains bind 
to antigen the same heavy or light chains alone do not. 
Therefore, in the case of NPN, it is believed that there 
10 are many heavy and light chains that only bind antigen 
when they are combined with specific light and heavy 
chains respectively. 

To assess the ability to screen large numbers 
of clones and obtain a more quantitative estimate of the 
15 frequency of antigen binding clones in the combinatorial 

library, one million phage plaques were screened and 
approximately 100 clones which bound to antigen were 
identified. For six clones which were believed to bind 
NPN, a region of the plate containing the positive and 
20 approximately 20 surrounding bacteriophage plaques was 
"cored", replated, and screened with duplicate lifts 
(Figure 15) . As expected, approximately one in twenty of 
the phage specifically bind to antigen. "Cores" of 
regions of the plated phage believed to be negative did 
25 not give positives on replating. 

To determine the specificity of the antigen- 
antibody interaction, antigen binding was competed with 
free unlabeled antigen as shown in Figure 16. 
Competition studies showed that individual clones could 
30 be distinguished on the basis of antigen affinity. The 
concentration of free antigen required for complete 
• inhibition of binding varied between 10-100 x 10 9 M 

suggesting that the expressed Fab fragments had binding 
constants in the nanomolar range. 
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D. Composition of the Clorm fi and Their 
Expressed Products 
In preparation for characterization of the 
protein products able to bind NPN as described in Example 
5 18C, a plasmid containing the heavy and light chain genes 
was excised from the appropriate "cored" bacteriophage 
plaque using M13mp8 helper phage. Mapping of the excised 
plasmid demonstrated a restriction pattern consistent 
with incorporation of heavy and light chain sequences. 

10 The protein products of one of the clones was analyzed by 
ELISA and Western blotting to establish the composition 
of the NPN binding protein. A bacterial supernate 
following IPTG induction was concentrated and subjected 
to gel filtration. Fractions in the molecular weight 

15 range 40-60 kD were pooled, concentrated and subjected to 
a further gel filtration separation. As illustrated in 
Figure 17, ELISA analysis of the eluting fractions 
demonstrated that NPN binding was associated with a 
protein of molecular weight about 50 kD which 

20 immunological detection showed contained both heavy and 
light chains. A Western blot (not shown) of a 
concentrated bacterial supernate preparation under non- 
reducing conditions was developed with anti-decapeptide 
antibody. This revealed a protein band of molecular 

25 weight of 50 kD. Taken together these results are 
consistent with NPN binding being a function of Fab 
fragments in which heavy and light chains are covalently 
linked. 

E. Comparison of the Properties of the In 
30 Vivo Repertoire Versus the Phage 

Combinatorial Library 
In this example a relatively restricted library 
was prepared because only a limited number of primers 
were used for PCR amplification of Fd sequences. The 
35 library is expected to contain only clones expressing 
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kappa/gammal sequences. However, this is not an inherent 
limitation of the method since additional primers can be 
added to amplify any antibody class or subclass. Despite 
this restriction we were able to isolate a large number 

5 of antigen binding clones. 

A central issue arising from this work is how a 
phage library prepared as described herein compares with 
the in vivo antibody repertoire in terms of size, 
characteristics of diversity, and ease of access. 

0 The size of the mammalian antibody repertoire 

is difficult to judge but a figure of the order of 10 6 -10 8 
different antigen specificities is often quoted. With 
some of the reservations discussed below, a phage library 
of this size or larger can readily be constructed by a 

5 modification of the current method. In fact once an 

initial combinatorial library has been constructed, heavy 
and light chains can be shuffled to obtain libraries of 
exceptionally large numbers. 

In principle, the diversity characteristics of 

0 the naive (unimmunized) jji vivo repertoire and 

corresponding phage library are expected to be similar in 
that both involve a random combination of heavy and light 
chains. However, different factors will act to restrict 
the diversity expressed by an in vivo repertoire and 

5 phage library. For example a physiological modification 
such as tolerance will restrict the expression of certain 
antigenic specificities from the in vivo repertoire but 
these specificities may still appear in the phage 
library. On the other hand, bias in the cloning process 

) may introduce restrictions into the diversity of the 

phage library. For example the representation of mRNA 
for sequences expressed by stimulated B-cells can be 
expected to predominate over those of unstimulated cells 
because of higher levels of expression. Different source 

3 tissues (e.g., peripheral blood, bone marrow or regional 
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lymph nodes) and different PCR primers (e.g., ones 
expected to amplify different antibody classes) may 
result in libraries with different diversity 
characteristics . 
5 Another difference between in vivo repertoire 

and phage library is that antibodies isolated from the 
former may have benefited from affinity maturation due to 
somatic mutations after combination of heavy and light 
chains whereas the latter randomly combines the matured 

10 heavy and light chains. Given a large enough phage 

library derived from a particular in vivo repertoire , the 
original matured heavy and light chains will be 
recombined. However, since one of the potential benefits 
of this new technology is to obviate the need for 

15 immunization by the generation of a single highly diverse 
"generic" phage library, it would be useful to have 
methods to optimize sequences to compensate for the 
absence of somatic mutation and clonal selection. Three 
procedures are made readily available through the methods 

20 of the present invention. First, saturation mutagenesis 
may be performed on the CDR's and the resulting Fabs can 
be assayed for increased function. Second, a heavy or a 
light chain of a clone which binds antigen can be 
recombined with the entire light or heavy chain libraries 

25 respectively in a procedure identical to the one used to 
construct the combinatorial library. Third, iterative 
cycles of the two above procedures can be performed to 
further optimize the affinity or catalytic properties of 
the immunoglobulin. It should be noted that the latter 

30 two procedures are not permitted in B-cell clonal 

selection which suggests that the methods described here 
may actually increase the ability to identify optimal 
sequences . 

Access is the third area where it is of 
35 interest to compare the in vivo antibody repertoire and 
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phage library. In practical terms the phage library is 
much easier to access. The screening methods allow one 
to survey at least 50,000 clones per plate so that 10 6 
antibodies can be readily examined in a day. This factor 
5 alone should encourage the replacement of hybridoma 

technology with the methods described here. The most 
powerful screening methods utilize selection which may be 
accomplished by incorporating selectable markers into the 
antigen such as leaving groups necessary for replication 
10 of auxotrophic bacterial strains or toxic substituents 

susceptible to catalytic inactivation. There are also 
further advantages related to the fact that the in vivo 
antibody repertoire can only be accessed via immunization 
which is a selection on the basis of binding affinity. 
15 The phage library is not similarly restricted. For 

example, the only general method to identify antibodies 
with catalytic properties has been by pre-selection on 
the basis of affinity of the antibody to a transition 
state analogue. No such restrictions apply to the in 
20 vitro library where catalysis can, in principle, be 

assayed directly. The ability to directly assay large 
numbers of antibodies for function may allow selection 
for catalysts in reactions where a mechanism is not well 
defined or synthesis of the transition state analog is 
25 difficult. Assaying for catalysis directly eliminates 
the bias of the screening procedure for reaction 
mechanisms pejorative to a synthetic analog and therefore 
simultaneous exploration of multiple reaction pathways 
for a given chemical transformation are possible. 
30 The methods disclosed herein describe 

generation of Fab fragments which are clearly different 
in a number of important respects from intact (whole) 
antibodies. There is undoubtedly a loss of affinity in 
having monovalent Fab antigen binders but this can be 
35 compensated by selection of suitably tight binders. For 
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a n umb er of applications such as diagnostics and 
biosensors it may be preferable to have monovalent Fab 
fragments. For applications requiring Fc effector 
functions, the technology already exists for extending 
5 the heavy chain gene and expressing the glycosylated 
whole antibody in mammalian cells. 

The ideas presented here address the bottle 
neck in the identification and evaluation of antibodies. 
It is now possible to construct and screen at least three 
10 orders of magnitude more clones with mono-specificity 

than previously possible. The potential applications of 
the method should span basic research and applied 
sciences. 

The foregoing is intended as illustrative of 
15 the present invention but not limiting. Numerous 

variations and modifications can be effected without 
departing from the true spirit and scope of the 
invention. 
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What Is Claimed Is: 

1. A method of producing a conserved 
receptor-coding nucleic acid, which method 
comprises : 

(a) synthesizing a conserved 
receptor-coding gene library containing a plurality 
of different receptor-coding DNA homologs by: 

(i) separating the strands of 
a repertoire of conserved receptor-coding genes, 
said repertoire comprising double-stranded nucleic 
acids each containing a receptor-coding strand 
annealed to a complementary strand; 

(ii) treating said separated 
strands, under conditions suitable for polymerase 
chain reaction amplification, with first and second 
polynucleotide synthesis primers, each of said 
first primers having a nucleotide sequence capable 
of hybridizing to a sequence conserved among said 
receptor-coding strands, and each of said second 
primers having a nucleotide sequence capable of 
hybridizing to a sequence conserved among said 
complementary strands, said primers being capable 
of priming the amplification of a plurality of 
different receptor-coding DNA homologs from said 
receptor-coding gene repertoire, said treating 
producing said conserved receptor-coding gene 
library. 

2. The method of claim 1 wherein said 
conserved receptor-coding nucleic acid codes for a 
V H , said conserved receptor-coding genes are V H - 
coding genes, and said receptor-coding DNA homologs 
are V H -coding DNA homologs. 

3. The method of claim 2 wherein said 
first polynucleotide synthesis primer hybridizes to 
an immunoglobulin J H or framework region nucleotide 
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sequence. 

4. The method of claim 2 wherein said 
second polynucleotide synthesis primer hybridizes 
to a framework, leader or promoter region of a V H 
immunoglobulin gene. 

5. The method of claim 2 further 
comprising segregating from said V H -coding library a 
V H -coding DNA homolog that codes for a receptor of 
predetermined specificity. 

6. The method of claim 5 wherein said 
segregating comprises: 

(a) operatively linking for 
expression each of a plurality of said different V H - 
coding DNA homologs to an expression vector, 
thereby forming a plurality of different V H - 
expression vectors; 

(b) transforming a population of 
host cells compatible with said expression vector 
with a plurality of said different V H -expression 
vectors to produce a transformed population of host 
cells whose members contain said V H -expression 
vectors ; 

(c) culturing said transformed 
population under conditions for expressing the 
receptors coded for by said V H -coding DNA homologs; 

(d) assaying the members of said 
transformed population for expression of a receptor 
capable of binding said preselected ligand, thereby 
identifying transformants containing said V H -coding 
DNA homolog; and 

(e) segregating an identified 
transformant of step (d) from said population, 
thereby producing said conserved V H -coding nucleic 
acid. 

7. The method of claim 5 wherein said 
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isolated gene codes for a catalytic receptor. 

8. The method of claim 6 wherein said 
host cells express a V L molecule, and said 
identified transformants express a F v that binds 

5 said preselected ligand. 

9, The method of claim 5 wherein all of 
the members of said population of host cells 
express the same preselected V L# and said identified 
transformants express a F v that binds said 

10 preselected ligand. 

10. The method of claim 5 wherein said 
receptor contains a preselected epitope coded for 
by either of said primers or said expression 
vector. 

15 11. The method of claim 5 wherein said 

expression vector is an episome, phage or plasmid 
comprised of a selectable marker gene. 

12. The method of claim 1 wherein said 
conserved receptor-coding nucleic acid codes for a 

20 V L/ said conserved receptor-coding genes are V L - 

coding genes, and said receptor-coding DNA homologs 
are V L -coding DNA homologs. 

13. The method of claim 12 further 
comprising segregating from said V L -coding library a 

25 V L -coding DNA homolog that codes for a V L capable of 
modulating the binding affinity of a preselected V H . 

14. The method of claim 13 wherein said 
segregating comprises: 

(a) operatively linking for 

30 expression a portion of the V L -coding DNA homologs 

produced to a vector to form a V L -expression vector; 

(b) transforming a population of 
compatible host cells capable of expressing said 
preselected receptor with a plurality of said V L - 

35 expression vectors; 
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(c) culturing said transformed 
population under conditions for expressing both the 
polypeptide coded for by said V L -coding DNA homolog 
and said preselected receptor to produce a F v ; and 

(d) segregating from said culture a 
transformant producing a F v having a binding 
affinity for a ligand bound by said preselected 
receptor that is different from that of said 
preselected ligand binding polypeptide alone, 
thereby isolating said conserved V H -coding nucleic 
acid. 

15. The method of claim 12 wherein said 
first polynucleotide synthesis primer hybridizes to 
an immunoglobulin J L or framework region nucleotide 
sequence. 

16. The method of claim 12 wherein said 
second polynucleotide synthesis primer hybridizes 
to a framework, leader or promoter region of a V L 
immunoglobulin gene. 

17. The method of claim 12 wherein said 
F v is catalytic. 

18. The method of claim 1 wherein said 
synthesizing is performed using a plurality of 
different first primers. 

19. The method of claim 1 wherein said 
synthesizing is performed using a plurality of 
different second primers. 

20. The method of claim 1 wherein said 
synthesizing is performed using a plurality of 
different first polynucleotide synthesis primers 
and a plurality of different second polynucleotide 
synthesis primers. 

21. The method of claim 1 wherein step 
(a) is performed a plurality of times, each time 
using a different repertoire of conserved receptor- 
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coding genes, and admixing one or more of the 
conserved receptor-coding gene libraries produced 
each time. 

22. The method of claim 6 wherein said 
expression vector molecules are linear DNA 
expression vector molecules. 

23. The method of claim 22 wherein said 
linear DNA expression vector molecules are phage 
vector molecules. 

24. The method of claim 23 wherein said 
lambda phage vector molecules are Lambda Zap II V H 
molecules. 

25. The method of claim 23 further 
including operatively linking a V L -coding gene to 
said phage vector molecules. 

26. The method of claim 25 wherein said 
V H -coding DNA homolog and said V L -coding gene are 
operatively linked to said phage vector molecules 
in an orientation for dicistronic expression. 

27. A method of producing a catalytic 
receptor comprising: 

(a) operatively linking for 
expression a gene isolated according to claim 7 to 

a suitable expression vector to form a V H -expression 
vector; 

(b) transforming a host cell 
compatible with said expression vector to produce a 
transformant; 

(c) culturing said transformant 
under conditions for expressing the catalytic 
receptor coded for by said V H -coding DNA homolog, 
thereby producing said catalytic receptor in said 
culture ; and 

(d) recovering from said culture 
said catalytic receptor. 
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28. The method of claim 27 wherein said 
host cell contains a V L -coding gene that expresses a 
V L capable of modulating the catalytic activity of 
said produced catalytic receptor, and wherein said 

5 produced catalytic receptor is present as a part of 
an F v comprised of said receptor and said V L . 

29. The method of claim 28 wherein said 
isolated gene and said V L -coding gene are 
operatively linked for expression to the same 

10 expression vector. 

30. A method of producing an isolated 
coexpression vector capable of expressing first and 
second polypeptides from respective first and 
second genes , said first and second polypeptides 

15 being capable of forming a heterodimeric receptor 
of predetermined specificity, which method 
comprises: 

(a) synthesizing a first 
polypeptide-coding gene library containing a 
20 plurality of different first polypeptide-coding DNA 
homologs by: 

(i) separating the strands of 
a repertoire of first polypeptide-coding genes, 
said repertoire comprising double-stranded nucleic 

25 acids each containing a first polypeptide-coding 
strand annealed to a complementary strand; 

(ii) treating said separated 
strands, under conditions suitable for polymerase 
chain reaction amplification, with first and second 

30 polynucleotide synthesis primers, each of said 

first primers having a nucleotide sequence capable 
of hybridizing to a sequence conserved among said 
first polypeptide-coding strands, and each of said 
second primers having a nucleotide sequence capable 

35 of hybridizing to a sequence conserved among said 
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complementary strands, said primers being capable 
of priming the amplification of a plurality of 
different first polypeptide-coding DNA homologs 
from said first polypeptide-coding gene repertoire, 
said treating producing said first polypeptide- 
coding gene library; 

(b) synthesizing a second 
polypeptide-coding gene library containing a 
plurality of different second polypeptide-coding 
DNA homologs by: 

(i) separating the strands of 
a repertoire of second polypeptide-coding genes, 
said repertoire comprising double-stranded nucleic 
acids each containing a second polypeptide-coding 
strand annealed to a second complementary strand; 

(ii) treating said separated 
strands, under conditions suitable for polymerase 
chain reaction amplification, with third and fourth 
polynucleotide synthesis primers, each of said 
third primers having a nucleotide seguence capable 
of hybridizing to a sequence conserved among said 
second polypeptide-coding strands, and each of said 
fourth primers having a nucleotide sequence 
corresponding to a sequence conserved among said 
second complementary strands, said primers being 
capable of priming the amplification of a plurality 
of different second polypeptide-coding DNA homologs 
from said second polypeptide-coding gene 
repertoire, said treating producing said second 
polypeptide-coding gene library; 

(c) forming a diverse library of 
coexpression vectors by treating expression vector 
molecules adapted for ligation to the first 
polypeptide- and second polypeptide-coding DNA 
homologs of steps (a) (ii) and (b) (ii) , 
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respectively , with a diverse plurality of said 
first polypep tide-coding DNA homologs and a diverse 
plurality of said second polypeptide-coding DNA 
homologs, under conditions suitable for DNA * 
5 ligation to produce a plurality of different 

coexpression vectors, each of said different * 
coexpression vectors being capable of expressing a 
heterodimeric receptor molecule comprising a 
combination of first and second polypeptides that 

10 is different from the combination of first and 

second polypeptides forming heterodimeric receptor 
molecules expressed by any other of said different 
coexpression vectors; and 

(d) segregating from said diverse 

15 library of coexpression vectors a coexpression 
vector capable of expressing an antibody of 
predetermined specificity. 

31. The method of claim 30 wherein said 
expression vector molecules are linear DNA 

20 expression vector molecules. 

32. The method of claim 31 wherein said 
linear DNA expression vector molecules are phage 
vector molecules. 

33. The method of claim 30 wherein said 
25 first polypeptide is a V H . 

34. The method of claim 33 wherein said 
second polypeptide is a V L . 

35. A method of producing a monoclonal 
antibody of predetermined specificity, which method 

30 comprises: 

(a) synthesizing a V H -coding gene 
library containing a plurality of different V H - * 
coding DNA homologs by: 

(i) separating the strands of ' 
35 a repertoire of V H -coding genes, said repertoire 
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comprising double-stranded nucleic acids each 
containing a V H -coding strand annealed to a 
complementary strand; 

(ii) treating said separated 
5 strands, under conditions suitable for polymerase 

chain reaction amplification, with first and second 
polynucleotide synthesis primers, each of said 
first primers having a nucleotide sequence capable 
of hybridizing to a sequence conserved among said 

10 V H -coding strands, and each of said second primers 
having a nucleotide sequence capable of hybridizing 
to a sequence conserved among said complementary 
strands, said primers being capable of priming the 
amplification of a plurality of different V H -coding 

15 DNA homologs from said V H -coding gene repertoire, 

said treating producing said V H -coding gene library; 

(b) synthesizing a V L -coding gene 
library containing a plurality of different V L - 
coding DNA homologs by: 

20 (i) separating the strands of 

a repertoire of V L -coding genes, said repertoire 
comprising double-stranded nucleic acids each 
containing a V L -coding strand annealed to a 
complementary strand; 

25 (ii) treating said separated 

strands, under conditions suitable for polymerase 
chain reaction amplif ication, with third and fourth 
polynucleotide synthesis primers, each of said 
third primers having a nucleotide sequence capable 

30 of hybridizing to a sequence conserved among said 
V L -coding strands, and each of said fourth primers 
having a nucleotide sequence corresponding to a 
sequence conserved among said complementary 
strands, said primers being capable of priming the 

35 amplification of a plurality of different V L -coding 
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DNA homologs from said V L -coding gene repertoire, 
said heating producing said V L -coding gene library; 

(c) forming a diverse library of 
coexpression vectors by treating expression vector 

5 molecules adapted for ligation to the V H - and V L - 

coding DNA homologs of steps (a) (ii) and (b) (ii) , 
respectively, with a diverse plurality of said V H - 
coding DNA homologs and a diverse plurality of said 
V L -coding DNA homologs, under conditions suitable 

10 for DNA ligation to produce a plurality of 

different coexpression vectors, each of said 
different coexpression vectors being capable of 
expressing an antibody molecule comprising a 
combination of V H and V L polypeptides that is 

15 different from the combination of V H and V L 

polypeptides forming antibody molecules expressed 
by any other of said different coexpression 
vectors ; 

(d) transforming a population of 
20 host cells compatible with said coexpression 

vectors with a plurality of said different 
coexpression vectors to produce a transformed 
population; 

(e) culturing said transformed 
25 population under conditions for expressing the 

antibody molecules coded for by said V H - and VL- 
coding DNA homologs; 

(f) assaying the members of said 
transformed population for expression of an 

30 antibody molecule capable of binding a preselected 
ligand; thereby identifying a transformant capable 
of producing said monoclonal antibody; and 

(g) harvesting from a monoclonal 
culture of said identified transformant of step (f) 

35 the antibody molecules produced by said culture, 
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thereby producing said monoclonal antibody. 

36. The method of claim 35 wherein said 
monoclonal antibody is catalytic. 

37. A method of producing a conserved 
receptor-coding gene library, which method 
comprises: 

(a) synthesizing a plurality of 
different conserved receptor-coding DNA homologs 
by: 

(i) subjecting said conserved 
receptor-coding gene repertoire to a first primer 
extension reaction utilizing a first polynucleotide 
synthesis primer capable of initiating said first 
reaction by hybridizing to a nucleotide sequence 
conserved within said repertoire, thereby producing 
a plurality of different receptor-coding DNA 
homolog compliments, and subjecting said 
compliments to a second primer extension reaction 
utilizing a second polynucleotide synthesis primer 
capable of initiating said second reaction by 
hybridizing to a nucleotide sequence conserved 
among said compliments, thereby producing a 
plurality of different receptor-coding DNA 
homologs, or 

(ii) subjecting a complement 
of a conserved receptor-coding gene repertoire to a 
third primer extension reaction utilizing a third 
polynucleotide synthesis primer capable of 
initiating said third primer extension reaction by 
hybridizing to a nucleotide sequence conserved 
among said complements; and 

(b) operatively linking for 
expression a plurality of different receptor-coding 
DNA homologs produced to a vector to form a 
plurality of different receptor-expression vectors. 



128 



38* The method of claim 37 wherein said 
first, second and third polynucleotide synthesis 
primers encode a predetermined restriction 
endonuclease recognition site. 

39. The method of claim 37 wherein said 
receptor-coding gene codes for a V H . 

40. The method of claim 37 wherein said 
receptor-coding gene codes for a V L . 

41. The library produced by the method 
of claim 39. 

42. The library produced by the method 
of claim 40. 

43. The method of claim 39 or 40 wherein 
said first polynucleotide synthesis primer 
hybridizes to a framework region nucleotide 
sequence. 

44. The method of claim 39 wherein said 
first polynucleotide synthesis primer hybridizes to 
a framework 3 region nucleotide sequence. 

45. The method of claim 39 or 40 wherein 
said first polynucleotide synthesis primer 
hybridizes to a J H region nucleotide sequence. 

46. The method of claim 39 wherein said 
first polynucleotide synthesis primer hybridizes to 
a hinge region nucleotide sequence. 

47. The method of claim 39 or 40 wherein 
said first polynucleotide synthesis primer 
hybridizes to a constant region nucleotide 
sequence. 

48. The method of claim 6 wherein said 
host cells express a plurality of different V H 
molecules, and said identified transformants 
express a Fab that binds said preselected ligand. 

49. A gene library comprising an 
isolated admixture of at least 10 3 different 
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conserved receptor-coding DNA homologs, a plurality 
of which share a conserved nucleotide sequence. 

50. The gene library of claim 49 wherein 
said homologs are individually operatively linked 
to an expression vector. 

51. The gene library of claim 50 wherein 
said homologs are individually present in a 
compatible host transformed therewith. 

52. The gene library comprising at least 
10 5 different coexpression vectors, each of said 
coexpression vectors being capable of expressing a 
heterodimeric receptor molecule comprising a 
combination of first and second polypeptides that 
is different from the combination of first and 
second polypeptides forming heterodimeric receptor 
molecules expressed by any other of said different 
coexpression vectors. 

53. The gene library of claim 52 wherein 
each of said coexpression vectors comprise a first 
polypeptide- and second polypeptide-coding DNA 
homolog operatively linked for dicistronic 
expression to a linear DNA expression vector. 

54. The gene library of claim 53 wherein 
said expression vector is lambda phage or a 
derivative thereof. 

55. The gene library of claim 52 wherein 
said first and second polypeptides are V H and V L 
polypeptides , respectively . 

56. A receptor-coding gene library 
produced by the method of claim 1. 

57. The gene library produced by the 
method of claim 2. 

58. A gene library comprising at least 
10 5 different receptor-coding DNA homologs, each of 
said homologs present as a population of DNA 
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strands wherein the ratio of the number of said 
strands of a first length to the number of said 
strands having a length other than said first 
length is at least 4:1. 

59. The gene library produced by the 
method of claim 37. 

60. The gene library produced by the 
method of claim 38. 

61. The method of claim 37 wherein said 
expression vector molecules are linear DNA 
expression vector molecules. 

62. The method of claim 61 wherein said 
linear DNA expression vector molecules are phage 
vector molecules. 

63. The bacterial expression vector 
Lambda Zap II V H . 

64. The bacterial expression vector 
Lambda Zap II V L . 

65. Novel coexpression vectors produced 
according to the method of claim 30. 

66. Gene libraries produced according to 
the method of claim 30 that have a plurality of 
different coexpression vectors. 

67. Novel monoclonal antibodies produced 
by the method of claim 35 having predetermined 
specificity. 

68. Novel isolated F v molecules produced 
by the process of claim 8 that are capable of 
binding a preselected ligand, wherein said V H and V L 
coding DNA sequences originate from different 
cells. 

69. Novel receptors produced according 
to the process of claim 1 that are capable of 
binding a preselected ligand. 
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70. Novel receptors produced according 
to the process of claim 6 that are capable of 
binding a preselected ligand. 

71. Novel F v molecules produced by the 
process of claim 35 that are capable of binding a 
preselected ligand. 

72. Transformed host cells produced 
according to claim 27. 

73. Novel polypeptide genes produced by 
the method of claim 14 that are capable of 
modulating the binding affinity of a preselected 
receptor. 

74. Novel F v molecules produced by the 
process of claim 14 that are capable of binding a 
preselected ligand. 

75. Transformed host cells produced 
according to claim 14. 

76. Novel catalytic receptors produced 
according to the method of claim 27. 

77. The use of a linear, double stranded 
DNA vector for randomly bringing together V H - and 
V L -coding DNA sequences, said vector having a V H - 
coding DNA sequence operably linked to a promoter 
and a site adapted so that the vector can be 
operably linked to a liner, double stranded DNA 
sequence having a V L -coding DNA sequence. 

78. The use of a linear, double stranded 
DNA vector for randomly bringing together V H - and 
V t -coding DNA sequences, said vector having a V L - 
coding DNA sequence operably linked to a promoter 
and a site adapted so that the vector can be 
operably linked to a linear, double stranded DNA 
sequence having a V H -coding DNA sequence. 

79. The use of claim 76 wherein said 
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site is found in both said vector having said V H - 
coding DNA sequence and in said linear double- 
stranded DNA sequence which has a V L -coding DNA 
sequence, and is positioned in said vector and said 
5 linear double stranded DNA sequence such that the 
vectors can be operably linked for coexpression of 
said V L -coding DNA sequence and said V H -coding DNA 
sequence. 

80. The use of claim 77 wherein said 
10 site is found in both said vector having said V L - 

coding DNA sequence and in said linear double- 
stranded DNA sequence which has a V H -coding DNA 
sequence, and is positioned in said vector and said 
linear double stranded DNA sequence such that the 
15 vectors can be operably linked for coexpression of 
said V L -coding DNA sequence and said V H -coding DNA 
sequence. 

81. A cleaved linear, double stranded 
DNA sequence vector containing a V H -DNA coding 

20 sequence selected from a V H -coding DNA sequence 

library, said vector having been cleaved so that a 
second DNA sequence comprising a V L -coding DNA 
sequence can be operably linked to it. 

82. A method of producing coexpression 
25 vector library capable of expressing V H and V L 

polypeptides from respective V H and V L genes, said 
V H and V L polypeptides being capable of forming a 
heterodimeric receptor, which method comprises: 

(a) synthesizing a V H -coding gene 

30 library containing a plurality of different VH- 
coding DNA homologs by: 

(i) separating the strands of 
a repertoire of V H -coding genes, said repertoire 
comprising double-stranded nucleic acids each 

35 containing a V H -coding strand annealed to a 
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complementary strand; 

(ii) treating said separated 
strands, under conditions suitable for polymerase 
chain reaction amplification, with first and second 
polynucleotide synthesis primers, each of said 
first primers having a nucleotide sequence capable 
of hybridizing to a sequence conserved among said 
V H -coding strands, and each of said second primers 
having a nucleotide sequence capable of hybridizing 
to a sequence conserved among said complementary 
strands, said primers being capable of priming the 
amplification of a plurality of different V H -coding 
DNA homologs from said V„-coding gene repertoire, 
said treating producing said V H -coding gene library; 

(b) synthesizing a second 
polypeptide-coding gene library containing a 
plurality of different V L -coding DNA homologs by: 

(i) separating the strands of 
a repertoire of V L -coding genes, said repertoire 
comprising double-stranded nucleic acids each 
containing a V L -coding strand annealed to a second 
complementary strand; 

(ii) treating said separated 
strands, under conditions suitable for polymerase 
chain reaction amplification, with third and fourth 
polynucleotide synthesis primers, each of said 
third primers having a nucleotide sequence capable 
of hybridizing to a sequence conserved among said 
V L -coding strands, and each of said fourth primers 
having a nucleotide sequence corresponding to a 
sequence conserved among said second complementary 
strands, said primers being capable of priming the 
amplification of a plurality of different V L -coding 
DNA homologs from said second V L -coding gene 
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repertoire, said treating producing said second 
polypeptide-coding gene library; 

(c) forming a diverse library of 
coexpression vectors by treating expression vector 
molecules adapted for ligation to the V H - and V L - 
coding DNA homologs of steps (a) (ii) and (b) (ii) , 
respectively, with a diverse plurality of said V H - 
coding DNA homologs and a diverse plurality of said 
V L polypeptide-coding DNA homologs, under conditions 
suitable for DNA ligation to produce a plurality of 
different coexpression vectors. 

83. The method of claim 82 wherein each 
of said different coexpression vectors being 
capable of expressing a heterodimeric receptor 
molecule comprising a combination of first and 
second polypeptides that is different from the 
combination of first and second polypeptides 
forming heterodimeric receptor molecules expressed 
by any other of said different coexpression 
vectors . 

84. A coexpression vector library 
produced by the method of claim 83. 

85. The coexpression library of claim 84 
wherein said expression vector moleculor are 
derived from lambda phase and said V H - and V L -coding 
DNA homologs are operatively linked to said 
expression vector molecules for dicistronic 
expression. 
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